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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Bosch, Hendrick J. 

Stiekeraa, Willem J, 

(ii) TITLE OF INVENTION: Hybrid Toxin 

(iii) NUMBER OF SEQUENCES: 15 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Novartis Corporation 

(B) STREET: 3 054 Cornwall is Road 

(C) CITY: Research Triangle Park 

(D) STATE: NC 

(E) COUNTRY: USA 

(F) ZIP: 27709 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentin Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING. DATE: 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/602,737 

(B) FILING DATE: 21-FEB-1996 

(viii) ATTORNEY/AGENT INFORMATION: ' 

(A) NAME: Meigs, J. Timothy 

(B) REGISTRATION NUMBER: 3 8,241 

(C) REFERENCE/DOCKET NUMBER: 13 0-4080/PCT/CIP 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 919-541-8587 

(B) TELEFAX: 919-541-8689 



(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE -CHARACTERISTICS: 

(A) LENGTH: 3567 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 
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(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus thuringiensis 

(ix)* FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: l.,3567 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

ATG GAG GAA AAT AAT CAA AAT CAA TGC ATA CCT TAC AAT TGT TTA AGT 48 
Met Glu Glu Asn Asn Gin Asn Gin Cys lie Pro Tyr Asn Cys Leu Ser _ 
15 10 15 

Q AAT CCT GAA GAA GTA CTT TTG GAT GGA GAA CGG ATA TCA ACT GGT AAT 96 

Asn Pro Glu Glu Val Leu Leu Asp Gly Glu Arg lie Ser Thr Gly Asn 
m 20 25 30 

ifl 

m TCA TCA ATT GAT ATT TCT CTG TCA CTT GTT CAG TTT CTG GTA TCT AAC 144 

Ser Ser lie Asp lie Ser Leu Ser Leu Val Gin Phe Leu Val Ser Asn 

^ 35 - 40 45 



TTT GTA CCA GGG GGA . GGA TTT TTA GTT GGA TTA ATA GAT TTT GTA TGG 192 
Phe Val Pro Gly Gly Gly Phe Leu Val Gly Leu lie Asp Phe Val Trp 
50 55 60 

GGA ATA GTT GGC CCT TCT CAA TGG GAT GCA TTT CTA GTA CAA ATT GAA 240 
Gly lie Val Gly Pro Ser Gin Trp Asp Ala Phe Leu Val Gin lie Glu 
65 70 75 80 

CAA TTA ATT AAT GAA AGA ATA GCT GAA TTT GCT AGG AAT GCT GCT ATT 288 
Gin Leu lie Asn Glu Arg lie Ala Glu Phe Ala Arg Asn Ala Ala lie 
85 ^ 90 ' 95 

GCT AAT TTA GAA GGA TTA GGA AAC AAT TTC AAT ATA TAT GTG GAA GCA 336 
Ala Asn Leu Glu Gly Leu Gly Asn Asn Phe Asn lie Tyr Val Glu Ala 
100 105 110 

TTT AAA GAA TGG GAA GAA GAT CCT AAT AAT CCA GAA ACC AGG ACC AGA 3 84 

Phe Lys Glu Trp Glu Glu Asp Pro Asn Asn Pro Glu Thr Arg Thr Arg 
115 120 125 

GTA ATT GAT CGC TTT CGT ATA CTT GAT GGG CTA CTT GAA AGG GAC ATT 432 
Val lie Asp Arg Phe Arg lie Leu Asp Gly Leu Leu Glu Arg Asp lie 
130 135 140 

CCT TCG TTT CGA ATT TCT GGA TTT GAA GTA CCC CTT TTA TCC GTT TAT 480 
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Pro Ser Phe Arg lie Ser Gly Phe Glu Va,l Pro Leu Leu Ser Val Tyr 
145 150. 155 160 

GCT CAA GCG GCC AAT CTG CAT CTA GCT ATA TTA AGA GAT TOT GTA ATT 528 
Ala Gin Ala Ala Asn Leu His Leu Ala He Leu Arg Asp Ser Val He 
165 170 175 

TTT GGA GAA AGA TGG GGA TTG ACA ACG ATA AAT GTC AAT GAA AAC TAT 576 
Phe Gly Glu Arg Trp Gly Leu Thr Thr He Asn Val Asn Glu Asn Tyr 
180 185 ... 190 

AAT AGA CTA ATT AGG CAT ATT GAT GAA TAT GCT GAT CAC TGT GCA AAT 624 
Asn Arg Leu He Arg His He Asp Glu Tyr Ala Asp His Cys Ala Asn 
195 200 205 

ACG TAT AAJT CGG GGA. TTA A-AT AAT TTA CCG AA-A TCT ACG TAT CAA GAT 672 
Thr Tyr Asn Arg Gly Leu Asn Asn Leu Pro Lys Ser Thr Tyr Gin Asp 
210 215 220 

TGG ATA ACA TAT AAT CGA TTA CGG AGA GAC TTA ACA TTG ACT GTA TTA 720 
Trp He Thr Tyr Asn Arg Leu Arg Arg Asp Leu Thr Leu Thr Val Leu 
225 230 235 240 

GAT ATC GCC GCT TTC TTT CCA AAC TAT GAC AAT AGG AGA TAT CCA ATT 768 
:5=; Asp He Ala Ala Phe Phe Pro Asn Tyr Asp Asn Arg Arg Tyr Pro He 
Ji 245 250 255 

•if CAG CCA GTT GGT CAA CTA ACA AGG GAA GTT TAT ACG GAC CCA TTA ATT 816 
Gin Pro Val Gly Gin . Leu Thr Arg Glu Val Tyr Thr Asp Pro Leu He 
260 265 270 

^,p. ipip^ pJ^rJ^ QQj^ QpjQ .prp;^ CAG TCT GTA GCT CAA TTA CCT ACT TTT AAC 864 
Asn Phe Asn Pro Gin Leu Gin Ser Val Ala Gin Leu Pro Thr Phe Asn 
W 275 280 285 

Cl 

|3 GTT ATG GAG AGC AGC GCA ATT AGA AAT CCT CAT TTA TTT GAT ATA TTG 912 
Val Met Glu Ser Ser Ala He Arg Asn Pro His Leu Phe Asp He Leu 
290 295 300 

AAT AAT CTT ACA ATC TTT ACG GAT TGG TTT AGT GTT GGA CGC AAT TTT 9 60 

Asn Asn Leu Thr He Phe Thr Asp Trp Phe Ser Val Gly Arg Asn Phe 
305 310 315 320 

TAT TGG GGA GGA CAT CGA GTA ATA TCT AGC CTT ATA GGA GGT GGT AAC 1008 
Tyr Trp Gly Gly His Arg Val He Ser Ser Leu He Gly Gly Gly Asn 
325 330 335 

ATA ACA TCT CCT ATA TAT GGA AGA GAG GCG AAC CAG GAG CCT CCA AGA 1056 
He Thr Ser Pro lie Tyr Gly Arg Glu Ala Asn Gin Glu Pro Pro Arg 
340 345 350 

TCC TTT ACT TTT AAT GGA CCG GTA TTT AGG ACT TTA TCA AAT CCT ACT 1104 
Ser Phe Thr Phe Asn Gly Pro Val Phe Arg Thr Leu Ser Asn Pro Thr 
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355 



360 



365 



TTA CGA TTA TTA CAG CAA CCT TOG CCA GCG CCA CCA TTT AAT TTA CGT 
Leu Arg Leu Leu Gin Gin Pro Trp Pro Ala Pro Pro Phe Asn Leu Arg 
370 375 380 

GOT GTT GAA GGA GTA GAA TTT TCT ACA CCT ACA AAT AGC TTT ACG TAT 
Gly Val Glu Gly Val Glu Phe Ser Thr Pro Thr Asn Ser Phe Thr Tyr 
385 390 395 400 

CGA GGA AGA GGT ACG GTT GAT TCT TTA ACT GAA TTA CCG CCT GAG GAT 
Arg Gly Arg Gly Thr Val Asp Ser Leu Thr Glu Leu Pro Pro Glu Asp 
405 410 415 



1152 



1200 



1248 



AAT AGT GTG CCA CCT CGC GAA GGA TAT AGT CAT CGT TTA TGT CAT GCA 
Asn Ser Val Pro Fro Arg Glu Gly Tyr Ser His Arg Leu Cys Hxs Ala 
420 425 430 



1296 



ACT TTT GTT CAA AGA TCT GGA ACA CCT TTT TTA ACA ACT GGT GTA GTA 1344 
Thr Phe Val Gin Arg Ser Gly Thr Pro Phe Leu Thr Thr Gly Val Val 
435 440 445 

TTT TCT TGG ACG CAT CGT AGT GCA ACT CTT ACA AAT ACA ATT GAT CCA 1392 
Phe Ser Trp Thr His Arg Ser Ala Thr Leu Thr Asn Thr lie Asp Pro 
450 455 460 



O 



ill 



GAG AGA ATT AAT CAA ATA CCT TTA GTG AAA GGA TTT AGA GTT TGG GGG 1440 
Glu Arg lie Asn Gin lie Pro Leu Val Lys Gly Phe Arg Val Trp Gly 
465 .470 475 480 

GGC ACC TCT GTC ATT ACA GGA CCA GGA TTT ACA GGA GGG GAT ATC CTT 1488 
Gly Thr. Ser Val lie Thr Gly Pro Gly Phe Thr Gly Gly Asp lie Leu 
485 490 495 

CGA AGA AAT ACC TTT GGT GAT TTT GTA TCT CTA CAA GTC AAT ATT AAT 1536 
Arg Arg Asn Thr Phe Gly Asp Phe Val Ser Leu Gin Val Asn lie Asn 
500 505 510 



TCA CCA ATT AGC CAA AGA TAC CGT TTA AGA TTT CGT TAC GCT TCC AGT 
Ser Pro lie Thr Gin Arg Tyr Arg Leu Arg Phe Arg Tyr Ala Ser Ser 
515 520 525 



15.84 



. AGG GAT GCA CGA GTT ATA GTA TTA ACA GGA GCG GCA TCC ACA GGA GTG 
Arg Asp Ala Arg Val lie Val Leu Thr Gly Ala Ala Ser Thr Gly Val 
530 535 540 



1632 



GGA GGC CAA GTT AGT GTA AAT ATG CCT CTT CAG AAA ACT ATG GAA ATA 
Gly Gly Gin Val Ser Val Asn Met Pro Leu Gin Lys Thr Met Glu lie 
545 550 555 560 



1680 



GGG GAG AAC TTA ACA TCT AGA ACA TTT AGA TAT ACC GAT TTT AGT AAT 
Gly Glu Ash Leu Thr Ser Arg Thr Phe Arg Tyr Thr Asp Phe Ser Asn 
565 570 575 



1728 
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m 



CCT TTT TCA TTT AGA GCT AAT CCA GAT ATA ATT GGG ATA AGT GAA CAA 1776 
Pro Phe Ser Phe Arg Ala Asn Pro Asp lie lie Gly lie Ser Glu Gin 
580 585 590 

CCT CTA TTT GGT GCA GOT TCT ATT AGT AGC GGT GAA CTT TAT ATA GAT 1824 
Pro Leu Phe Gly Ala Gly Ser lie Ser Ser Gly Glu Leu Tyr lie Asp 
595 600 605 

AAA ATT GAA ATT ATT CTA GCA GAT GCA ACA TTT GAA GCA GAA TCT GAT 1872 
Lys lie Glu He He Leu Ala Asp Ala Thr Phe Glu Ala Glu Ser Asp 
610 615 ■ 620 

TTA GAA AGA GCA CAA AAG GCG GTG AAT GCC CTG TTT ACT TCT TCC AAT 1920 
Leu Glu Arg Ala Gin Lys Ala Val Asn Ala Leu Phe Thr Ser Ser Asn 



CAA ATC GGG TTA AAA ACC GAT GTG ACG GAT TAT CAT ATT GAT CAA GTA l^^.^. 
Gin He Gly Leu Lys Thr Asp Val Thr Asp Tyr His He Asp Gin Val - 
645 650 655 

TCC AAT TTA GTG GAT TGT TTA TCA GAT GAA TTT TGT CTG GAT GAA AAG 2016 
Ser Asn Leu Val Asp Cys Leu Ser Asp Glu Phe Cys Leu Asp Glu Lys 
660 665 670 

CGA GAA TTG TCC GAG AAA GTC AAA CAT GCG AAG CGA CTC AGT GAT GAG 2064 
Arg Glu Leu Ser Glu Lys Val Lys His Ala Lys Arg Leu Ser Asp Glu 
675 680 685 

CGG AAT TTA CTT CAA GAT CCA AAC TTC AGA GGG ATC AAT AGA CAA CCA 2112 
Arg Asn Leu Leu Gin Asp Pro Asn Phe Arg Gly He Asn Arg Gin Pro 
690 695 700 

GAC CGT GGC TGG AGA GGA AGT ACA GAT ATT ACC ATC CAA GGA GGA GAT 2160 
Asp Arg Gly Trp Arg Gly Ser Thr Asp He Thr He Gin Gly Gly Asp 
705 710 715 720 

GAC GTA TTC AAA GAG AAT TAC GTC ACA CTA CCG GGT ACC GTT GAT GAG 2208 
Asp Val Phe Lys Glu Asn Tyr Val Thr Leu Pro Gly Thr Val Asp Glu 
725 730 735 

TGC TAT CCA ACG TAT TTA TAT CAG AAA ATA GAT GAG TCG AAA TTA AAA 2256 
Cys Tyr Pro Thr Tyr Leu Tyr Gin Lys He Asp Glu Ser Lys Leu Lys 
740 745 750 

GCT TAT ACC CGT TAT GAA TTA AGA GGG TAT ATC GAA GAT AGT CAA GAC 2304 
Ala Tyr Thr Arg Tyr Glu Leu Arg Gly Tyr He Glu Asp Ser Gin Asp 
755 760 765 

TTA GAA ATC TAT TTG ATC CGT TAC AAT GCA AAA CAC GAA ATA GTA AAT 23 52 

Leu Glu He Tyr Leu He Arg Tyr Asn Ala Lys His Glu He Val Asn 
770 775 780 
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GTG CCA GGC ACG GGT TCC TTA TGG CCG CTT TCA GCC CAA AGT CCA ATC 2400 
Val Pro Gly Thr Gly Ser Leu Trp Pro Leu Ser Ala Gin Ser Prp lie 
785 790 795 800 

GGA AAG TGT GGA GAA CCG AAT CGA TGC GCG CCA CAC CTT GAA TGG AAT 2448 
Gly Lys Cys Gly Glu Pro Asn Arg Cys Ala Pro His Leu Glu Trp Asn 
805 810 .815 

CCT GAT CTA GAT TGT TCC TGC AGA GAC GGG GAA AAA TGT GCA CAT CAT 2496 
Pro Asp Leu Asp Cys Ser Cys Arg Asp Gly Glu Lys Cys Ala His His 
820 825 830 

TCC CAT CAT TTC ACC TTG GAT ATT GAT GTT GGA TGT ACA GAC TTA AAT 2544 
Ser His His Phe Thr Leu Asp He Asp Val Gly Cys Thr Asp Leu Asn 
835 840 845 

GAG GAC TTA GGT GTA TGG GTG ATA TTC AAG ATT AAG ACG CAA GAT GGC 2592 
Glu Asp Leu Gly Val Trp Val lie Phe Lys He Lys Thr Gin Asp Gly 

850 855 860 ' ^ 

O CAT GCA AGA CTA GGG AAT CTA GAG TTT CTC GAA GAG AAA CCA TTA TTA 2640 

^0 His Ala Arg Leu Gly Asn Leu Glu Phe Leu Glu Glu Lys Pro Leu Leu 

m 865 870 875 880 

f|1 

ill GGG GAA GCA CTA GCT CGT GTG AAA AGA GCG GAG AAG AAG TGG AGA GAC 2688 

m Gly . Glu Ala Leu Ala Arg Val Lys Arg Ala Glu Lys Lys Trp Arg Asp 
m 885 890 895 



AAA CGA GAG AAA CTG CAG TTG GAA ACA AAT ATT GTT TAT AAA GAG GCA 2736 
Lys Arg Glu Lys Leu Gin Leu Glu Thr Asn He Val Tyr Lys Glu Ala 
900 905 910 

AAA GAA TCT GTA GAT GCT TTA TTT GTA AAC TCT CAA TAT GAT AGA TTA 2784 
Lys Glu Ser Val Asp Ala Leu Phe Val Asn Ser Gin Tyr Asp Arg Leu 
915 920 925 

CAA GTG GAT ACG AAC ATC GCG ATG ATT CAT GCG GCA GAT AAA CGC GTT 2832 
Gin Val Asp Thr Asn He Ala Jlet He His Ala Ala Asp Lys Arg Val 
930 935 940 

CAT AGA ATC CGG GAA GCG TAT CTG CCA GAG TTG TCT GTG ATT CCA GGT 2880 
His Arg He Arg Glu Ala Tyr Leu Pro Glu Leu Ser Val He Pro Gly 
945 950 955 960 

GTC AAT GCG GCC ATT TTC GAA GAA TTA GAG GGA CGT ATT TTT ACA GCG 2928 
Val Asn Ala Ala He Phe Glu Glu Leu Glu Gly Arg He Phe Thr Ala 
965 970 975 

TAT TCC TTA TAT GAT GCG AGA AAT GTC ATT AAA AAT GGC GAT TTC AAT 2976 
Tyr Ser Leu Tyr Asp Ala Arg Asn Val He Lys Asn Gly Asp Phe Asn 
980 985 990 

AAT GGC TTA TTA TGC TGG AAC GTG AAA GGT CAT GTA GAT GTA GAA GAG 3024 
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in 



Asn Gly Leu Leu Cys Trp Asn Val Lys Gly His Val Asp Val Glu Glu 
995 1000 1005 

CAA AAC AAC CAC CGT TCG GTC CTT GTT ATC CCA GAA TGG GAG GCA GAA 3072 
Gin Asn Asn His Arg Ser Val Leu Val lie Pro Glu Trp Glu Ala Glu 
1010 1015 1020 

GTG TCA CAA GAG GTT CGT GTC TGT CCA GGT CGT GGC TAT ATC CTT CGT 3120 
Val Ser Gin Glu Val Arg Val Cys Pro Gly Arg Gly Tyr lie Leu Arg 
1025 1030 1035 1040 

GTC ACA GCA TAT AAA GAG GGA TAT GGA GAG GGC TGC GTA ACG ATC CAT 3168 
Val Thr Ala Tyr Lys Glu Gly Tyr Gly Glu Gly Cys Val Thr lie His 
1045 1050 1055 

GAG ATC GAA GAG AAT ACA GAC GAA CTG AAA TTC AGC AAC TGT GTA GAA 3216 
Glu lie Glu Asp Asn Thr Asp Glu Leu Lys Phe Ser Asn Cys Val Glu 

1060 1065 1070 ^e- 

GAG GAA GTA TAT CCA AAC AAC ACA GTA ACG TGT AAT AAT TAT ACT GGG 3264 
Glu Glu Val Tyr Pro Asn Asn Thr Val Thr Cys Asn Asn Tyr Thr Gly 
1075 1080 1085 

ACT CAA GAA GAA TAT GAG GGT ACG TAC ACT TCT CGT AAT CAA GGA TAT 3312 
Thr Gin Glu Glu Tyr Glu Gly Thr Tyr Thr Ser Arg Asn Gin Gly Tyr 
1090 1095 1100 

GAC GAA GCC TAT GGT AAT AAC CCT TCC GTA CCA GCT GAT TAC GCT TCA 3360 
Asp Glu Ala Tyr Gly Asn Asn Pro Ser Val Pro Ala Asp Tyr Ala Ser 
1105 1110 1115 1120 

GTC TAT GAA GAA AAA TCG TAT ACA GAT GGA CGA AGA GAG AAT CCT TGT 3408 
Val Tyr Glu Glu Lys Ser Tyr Thr Asp Gly Arg Arg Glu Asn Pro Cys 
1125 1130 1135 

GAA TCT AAC AGA GGC TAT GGG GAT TAC ACA CCA CTA CCG GCT GGT TAT 3456 
Glu Ser Asn Arg Gly Tyr Gly Asp Tyr Thr Pro Leu Pro Ala Gly Tyr 
1140 ^ 1145 1150 

GTA ACA AAG GAT TTA GAG TAC TTC CCA GAG ACC GAT AAG GTA TGG ATT 3504 
Val Thr Lys Asp Leu Glu Tyr Phe Pro Glu Thr Asp Lys Val Trp lie 
1155 1160 1165 

GAG ATC GGA GAA ACA GAA GGA ACA TTC ATC GTG GAT AGC GTG GAA TTA 3552 
Glu lie Gly Glu Thr Glu Gly Thr Phe He Val Asp Ser Val Glu Leu 
1170 1175 1180 

CTC CTT ATG GAG GAA 3567 

Leu Leu Met Glu Glu 

1185 



(2) INFORMATION FOR SEQ ID NO: 2; 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1189 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Glu Glu Asn Asn Gin Asn Gin Cys He Pro Tyr Asn Cys Leu Ser 

1 ; 5 ^. 10 15 

Asn Pro Glu Glu Val Leu Leu Asp Gly Glu Arg He Ser Thr Gly Asn 
20 25 30 

Ser Ser He Asp He Ser Leu Ser Leu Val Gin Phe Leu Val Ser Asn 
35 40 45 

Phe Val Pro Gly Gly Gly Phe Leu Val Gly Leu He Asp Phe Val Trp 
50 55 60 

Gly He Val Gly Pro Ser Gin Trp Asp Ala Phe Leu Val Gin He Glu 
65 70 75 80 

Gin Leu He Asn Glu Arg He Ala Glu Phe Ala Arg Asn Ala Ala He 
85 90 95 

Ala Asn Leu Glu Gly Leu Gly Asn Asn Phe Asn He Tyr Val Glu Ala 
100 105 110 

Phe Lys Glu Trp Glu Glu Asp Pro Asn Asn Pro Glu Thr Arg Thr Arg 
115 120 125 

Val He Asp Arg Phe Arg He Leu Asp Gly Leu Leu Glu Arg Asp He 
130 135 140 

Pro Ser Phe Arg He Ser Gly ^he Glu Val Pro Leu Leu Ser Val Tyr 
145 150 155 160 

Ala Gin Ala Ala Asn Leu His Leu Ala He Leu Arg Asp Ser Val He 
165 170 . 175 

Phe Gly. Glu Arg Trp Gly Leu Thr Thr He Asn Val Asn Glu Asn Tyr 
180 185 190 

Asn Arg Leu He Arg His He Asp Glu Tyr Ala Asp His Cys Ala Asn 
195 200 205 

Thr Tyr Asn Arg Gly Leu Asn Asn Leu Pro Lys Ser Thr Tyr Gin Asp 
210 215 220 

Trp He Thr Tyr Asn Arg Leu Arg Arg Asp Leu Thr Leu Thr Val Leu 
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225 230 235 240 

Asp He Ala Ala Phe Phe Pro Asn Tyr Asp Asn Arg Arg Tyr Pro He 
245 250 255 



Gin Pro Val Gly Gin Leu Thr Arg Glu Val Tyr Thr Asp Pro Leu He 
260 265 270 

Asn Phe Asn Pro Gin Leu Gin Ser Val Ala Gin Leu Pro Thr Phe Asn 
275 280 285 

Val Met Glu Ser Ser Ala He Arg Asn Pro His Leu Phe Asp He Leu 
r29Q 295 300 

Asn Asn Leu Thr He Phe Thr Asp Trp Phe Ser Val Gly Arg Asn Phe 
3G5 310 315 320 

Tyr Trp Gly Gly His Arg Val He Ser Ser Leu He Gly Gly Gly Asn 
325 330 335 

He Thr Ser Pro He Tyr Gly Arg Glu Ala Asn Gin Glu Pro Pro Arg 
340 345 350 

Ser Phe Thr Phe Asn Gly Pro Val Phe Arg Thr Leu Ser Asn Pro Thr 
355 360 365 

Leu Arg Leu Leu Gin Gin Pro Trp Pro Ala Pro Pro Phe Asn Leu Arg 
370 375 380 

Gly Val Glu Gly Val Glu Phe Ser Thr Pro Thr Asn Ser Phe Thr Tyr 
385 390 395 400 

Arg Gly Arg Gly Thr Val Asp Ser Leu Thr Glu Leu Pro Pro Glu Asp 
405 410 415 

Asn Ser Val Pro Pro Arg Glu Gly Tyr Ser His Arg Leu Cys His Ala 
420 425 ■ 430 

Thr Phe Val Gin Arg Ser Gly Thr Pro Phe Leu Thr Thr Gly Val Val 
435 440 445 

. Phe Ser Trp Thr His Arg Ser Ala Thr Leu Thr Asn Thr He Asp Pro 
450 455 460 

Glu Arg He Asn Gin He Pro Leu Val Lys Gly Phe Arg Val Trp Gly 
465 470 475 480 

Gly Thr Ser Val He Thr Gly Pro Gly Phe Thr Gly Gly Asp He Leu 
485 490 495 

Arg Arg Asn Thr Phe Gly Asp Phe Val Ser Leu Gin Val Asn He Asn 
500 505 510 
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Ser Pro lie Thr Gin Arg Tyr Arg Leu Arg Phe Arg Tyr Ala Ser Ser 
515 520 525 

Arg Asp Ala Arg Val lie Val Leu Thr Gly Ala Ala Ser Thr Gly Val 
530 535 540 

Gly Gly Gin Val Ser Val Asn Met Pro Leu Gin Lys Thr Met Glu He 
545 550 555 560 

Gly Glu Asn Leu Thr Ser Arg Thr Phe Arg Tyr Thr Asp Phe Ser Asn 
565 570 575 

Pro Phe Ser Phe Arg Ala Asn Pro Asp He He Gly He Ser Glu Gin 
580 585 590 

Pro Leu Phe Gly Ala Gly Ser He Ser Ser Gly Glu Leu Tyr He Asp 
595 600 605 

Lys He Glu He He Leu Ala Asp Ala Thr Phe Glu Ala Glu Ser Asp 
610 615 620 

Leu Glu Arg Ala Gin Lys Ala Val Asn Ala Leu Phe Thr Ser Ser " Asn 
625 630 635 640 

Gin He Gly Leu Lys Thr Asp Val Thr Asp Tyr His He Asp Gin Val 
645 650 655 

Ser Asn Leu Val Asp Cys Leu Ser Asp Glu Phe Cys Leu Asp Glu Lys 
660 665 670 

Arg Glu Leu Ser Glu Lys Val Lys His Ala Lys Arg Leu Ser Asp Glu 
675 680 685 

Arg Asn Leu Leu Gin Asp Pro Asn Phe Arg Gly He Asn Arg Gin Pro 
690 695 700 

Asp Arg Gly Trp Arg Gly Ser Thr Asp He Thr He Gin Gly Gly Asp 
705 . 710 715 720 

Asp Val Phe Lys Glu Asn Tyr Val. Thr Leu Pro Gly Thr Val Asp Glu 
725 730 735 

Cys Tyr Pro Thr Tyr Leu Tyr Gin Lys He Asp Glu Ser Lys Leu Lys 
740 745 750 

Ala Tyr Thr Arg Tyr Glu Leu Arg Gly Tyr He Glu Asp Ser Gin Asp 
755 760 765 

Leu Glu He Tyr Leu He Arg Tyr Asn Ala Lys His Glu He Val Asn 
770 775 780 

Val Pro Gly Thr Gly Ser Leu Trp Pro Leu Ser Ala Cln Ser Pro He 
785 790 795 800 
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Gly Lys Cys Gly Glu Pro Asn Arg Cys Ala Pro His Leu Glu TrR Asn 

805 810 815 

Pro Asp Leu Asp Cys Ser Cys Arg Asp Gly Glu Lys Cys Ala His His 

820. 825 830 

Ser His His Phe Thr Leu Asp lie Asp Val Gly Cys Thr Asp Leu Asn 

835 840 845 

Glu Asp Leu Gly Val Trp Val lie Phe Lys lie Lys Thr Gin Asp Gly 



850 



855 



860 



His Ala Arg Leu Gly Asn Leu Glu Phe Leu Glu Glu Lys Pro Leu Leu 
865 870 875 880 

Gly Glu Ala Leu Ala Arg Val Lys Arg Ala Glu Lys Lys Trp Arg Asp 
885 890 895 

Lys Arg Glu Lys Leu Gin Leu Glu Thr Asn lie Val Tyr Lys Glu Ala 
900 905 910 

Lys Glu Ser Val Asp Ala Leu Phe Val Asn Ser Gin Tyr Asp Arg Leu 
915 920 925 

Gin Val Asp Thr Asn lie Ala Met lie His Ala Ala Asp Lys Arg Val 
930 935 940 

His Arg He Arg Glu Ala Tyr Leu Pro Glu Leu Ser Val He Pro Gly 
945 950 955 960 

Val Asn Ala Ala He Phe Glu Glu Leu Glu Gly Arg He Phe Thr Ala 
965 970 975 

Tyr Ser Leu Tyr Asp Ala Arg Asn Val He Lys Asn Gly Asp Phe Asn 
980 985 990 

Asn Gly Leu Leu Cys Trp Asn ^al Lys Gly His Val Asp Val Glu Glu 
995 1000 1005 

Gin Asn Asn His Arg Ser Val Leu Val He Pro Glu Trp Glu Ala Glu 
1010 1015 1020 

Val Ser Gin Glu Val Arg Val Cys Pro Gly Arg Gly Tyr He Leu Arg 
1025 1030 1035 1040 

Val Thr Ala Tyr Lys Glu Gly Tyr Gly Glu Gly Cys Val Thr He His 
1045 1050 1055 

Glu He Glu Asp Asn Thr Asp Glu Leu Lys Phe Ser Asn Cys Val Glu 
1060 1065 1070 



Glu Glu Val Tyr Pro Asn Asn Thr. Val Thr Cys Asn Asn Tyr Thr Gly 
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1075 1080 1085 

Thr Gin Glu Glu Tyr Glu Gly Thr Tyr Thr Ser Arg Asn Gin Gly Tyr 
1090 1095 1100 

Asp Glu Ala Tyr Gly Asn Asn Pro Ser Val Pro Ala Asp Tyr Ala Ser 
1105 1110 1115 1120 

Val Tyr Glu Glu Lys Ser Tyr Thr Asp Gly Arg Arg Glu Asn Pro Cys 
1125 1130 1135 

C31u Ser Asn Arg Gly Tyr Gly Asp Tyr Thr Pro* Leu Pro Ala Gly Tyr 
1140 1145 1150 

Val Thr Lys Asp Leu Glu Tyr Phe Pro Glu Thr Asp Lys Val Trp lie 
1155 1160 1165 

Glu lie Gly Glu Thr Glu Gly Thr Phe He Val Asp Ser Val Glu Leu 
1170 1175 1180 

Leu Leu Met Glu Glu 
1185 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3513 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
{ D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: ^ 

(A) ORGANISM: Bacillus thuringiensis 

(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 1..3513 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

ATG GAG ATA GTG AAT AAT CAG AAT CAA TGC GTG CCT TAT AAT TGT TTA 48 
Met Glu He Val Asn Asn Gin Asn Gin Cys Val Pro Tyr Asn Cys Leu 
1 5 10 .15 

AAT AAT CCT GAA AAT GAG ATA TTA GAT ATT GAA AGG TCA AAT AGT ACT 96 
Asn Asn Pro Glu Asn Glu He Leu Asp He Glu Arg Ser Asn Ser Thr 
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20 25 30 

GTA GCA AC A AAC ATC GCC TTG GAG ATT AGT CGT CTG CTC GCT TCC GCA 144 
Val Ala Thr Asn lie Ala Leu Glu lie Ser Arg Leu Leu Ala Ser Ala 
.35 40 45 

ACT CCA ATA GGG GGG ATT TTA TTA GGA TTG TTT GAT GCA ATA TGG GGG 192 
Thr Pro lie Gly Gly lie Leu Leu Gly Leu Phe Asp Ala lie Trp Gly 
50 55 60 

TCT ATA GGC CCT TCA CAA TGG GAT TTA TTT TTA GAG CAA ATT GAG CTA 240 
Ser lie Gly Pro Ser Gin Trp Asp Leu Phe Leu Glu Gin lie Glu Leu 
65 70 75 80 

TTG ATT GAC CAA AAA ATA GAG GAA TTC GCT AGA AAC CAG GCA ATT TCT 288 
Leu lie Asp Gin Lys lie Glu Glu Phe Ala Arg Asn Gin Ala lie Ser 
85 90 95 

AGA TTG GAA GGG ATA AGC AGT CTG TAC GGA ATT TAT ACA GAA GCT TTT 336 
Arg Leu Glu Gly lie Ser Ser Leu Tyr Gly lie Tyr Thr Glu Ala Phe 
100 105 110 

P AGA GAG TGG GAA GCA GAT CCT ACT AAT CCA GCA TTA AAA GAA GAG ATG 384 

tJl Arg Glu Trp Glu Ala Asp Pro Thr Asn Pro Ala Leu Lys Glu Glu Met 
HI 115 120 125 

in CGT ACT CAA TTT AAT GAC ATG AAC AGT ATT CTT GTA ACA GCT ATT CCT 432 

Q Arg Thr Gin Phe Asn Asp Met Asn Ser lie Leu Val Thr Ala lie Pro 
7 130 135 140 

CTT TTT TCA GTT CAA AAT TAT CAA GTC CCA TTT TTA TCA GTA TAT GTT 480 
Leu Phe Ser Val Gin Asn Tyr Gin Val Pro Phe Leu Ser Val Tyr Val 
!{j 145 150 155 160 

CAA GCT GCA AAT TTA CAT TTA TCG GTT TTG AGA GAT GTT TCA GTG TTT 528 
. Gin Ala Ala Asn Leu His Leu Ser Val Leu Arg Asp Val Ser Val Phe 

165 170 175 

"\ 

GGG CAG GCT TGG GGA TTT GAT ATA GCA ACA ATA AAT AGT CGT TAT AAT 576 
Gly Gin Ala Trp Gly Phe Asp lie Ala Thr lie Asn Ser Arg Tyr Asn 
180 185 190 

GAT CTG ACT AGA CTT ATT CCT ATA TAT ACA GAT TAT GCT GTA CGC TGG 624 
Asp Leu Thr Arg Leu lie Pro lie Tyr Thr Asp Tyr Ala Val Arg. Trp 
195 200 205 

TAC AAT ACG GGA TTA GAT CGC TTA CCA CGA ACT GGT GGG CTG CGA AAC 672 
Tyr Asn Thr Gly Leu Asp Arg Leu Pro Arg Thr Gly Gly Leu Arg Asn 
210 215 220 

TGG GCA AGA TTT AAT CAG TTT AGA AGA GAG TTA ACA ATA TCA GTA TTA 720 
Trp Ala Arg Phe Asn Gin Phe Arg Arg Glu Leu Thr lie Ser Val Leu 
225 230 235 240 
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GAT ATT ATT TCT TTT TTC AGA AAT TAG GAT TOT AGA TTA TAT CCA ATT 
Asp lie lie Ser Phe Phe Arg Asn Tyr. Asp Ser Arg Leu Tyr Pro lie 
245 250 255 



768 



CCA ACA AGC TCC CAA TTA ACG CGG GAA GTA TAT ACA GAT CCG GTA ATT 
Pro Thr Ser Ser Gin Leu Thr Arg Glu Val Tyr Thr Asp Pro Val lie 
260 265 270 



816 



AAT ATA ACT GAC TAT AGA GTT GGC CCC AGC TTC GAG AAT ATT GAG AAC 
Asn He Thr Asp Tyr Arg Val Gly Pro Ser Phe Glu Asn lie Glu Asn 
275 280 285 ^ 



864 



TCA GCC ATT AGA AGC CCC CAC CTT ATG GAC TTC TTA AAT AAT TTG ACC 
Ser Ala He Arg Ser Pro His Leu Met Asp Phe Leu Asn Asn Leu Thr 
290 295 300 



912 



ATT GAT. ACG GAT TTG ATT AGA GGT GTT CAC TAT TGG GCA GGG CAT CGT 
He Asp Thr Asp Leu He Arg Gly Val His Tyr Trp Ala Gly His Arg 
305 310 315 320 



9^0 



GTA ACT TCT CAT TTT ACA GGT AGT TCT CAA GTG ATA ACA ACC CCT CAA 
Val Thr Ser His Phe Thr Gly Ser Ser Gin Val He Thr Thr Pro Gin 
325 330 335 



1008 



TAT GGG ATA ACC GCA AAT GCG GAA CCA AGA CGA ACT ATT GCT CCT AGT 
Tyr Gly He Thr Ala Asn Ala Glu Pro Arg Arg Thr He Ala Pro Ser 
340 345 350 



1056 



ill 



ACT TTT CCA GGT CTT AAC CTA TTT TAT AGA ACA TTA TCA AAT CCT TTC 1104 
Thr Phe Pro Gly Leu Asn Leu Phe Tyr Arg Thr Leu Ser Asn Pro Phe 
355 360 365 

TTC CGA AGA TCA GAA AAT ATT ACT CCT ACC TTA GGG ATA AAT GTA GTA 1152 
Phe Arg Arg Ser Glu Asn He Thr Pro Thr Leu Gly He Asn Val Val 
370 375 380 



CAG GGA GTA GGG TTC ATT CAA CCA AAT AAT GCT GAA GTT CTA TAT AGA 
Gin Gly Val Gly Phe He Gin Pro Asn Asn Ala Glu Val Leu Tyr Arg 
385 390 395 400 



1200 



AGT AGG GGG ACA GTA GAT TCT CTT. AAT GAG TTA CCA ATT GAT GGT GAG 
Ser Arg Gly Thr Val Asp Ser Leu Asn Glu Leu Pro He Asp Gly Glu 
405 410 415 



1248 



AAT- TCA TTA GTT GGA TAT AGT CAT CGA TTA AGT CAT GTT ACA CTA ACC 
Asn Ser. Leu Val Gly Tyr Ser His Arg Leu Ser His Val Thr Leu Thr 
420 425 430 



1296 



AGG TCG TTA TAT AAT ACT AAT ATA ACT AGC CTG CCA ACA TTT GTT TGG 
Arg Ser Leu Tyr Asn Thr Asn He Thr Ser Leu Pro Thr Phe Val Trp 
435 440 445 



1344 
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ACA CAT CAC AGT GCT ACT AAT ACA AAT AC A ATT AAT CCA GAT ATT ATT 1392 
Thr His His Ser Ala Thr Asn Thr Asn Thr lie Asn Pro Asp lie lie 
450 455 460 

ACA CAA ATA CCT TTA GTG AAA GGA TTT AGA CTT GGT GGT GGC ACC TCT 1440 
Thr Gin He Pro Leu Val Lys Gly Phe Arg Leu Gly Gly Gly Thr Ser 
465 470 475 480 

GTC ATT AAA GGA CCA GGA TTT ACA GGA GGG GAT ATC CTT CGA AGA AAT 1488 
Val He Lys Gly Pro Gly Phe Thr Gly Gly Asp He Leu Arg Arg Asn 

485 . ... 490 , 495 



m 



3 U 

G 



ACC ATT GGT GAG TTT GTG TCT. TTA CAA GTC AAT ATT AAC TCA CCA ATT 1536 
Thr lie Gly Glu Phe Val Ser Leu Gin Val Asn He Asn Ser Pro He 
500 , 505 510 

ACC CAA AGA TAC CGT TTA AGA TTT CGT TAT GCT TCC AGT AGG GAT GCA 1584 
Thr Gin Arg Tyr Arg Leu Arg Phe Arg Tyr Ala Ser Ser Arg Asp Ala .-:7.-^" 
515 - 520 525 

CGA ATT ACT GTA GCG ATA GGA GGA CAA ATT AGA GTA GAT ATG ACC CTT 1632 
Arg He Thr Val Ala He Gly Gly Gin He Arg Val Asp Met Thr Leu 
530 535 540 

GAA AAA ACC ATG GAA ATT GGG GAG AGC TTA ACA TCT AGA ACA TTT AGC 1680 
Glu Lys Thr Met Glu He Gly Glu Ser Leu Thr Ser Arg Thr Phe Ser 
545 550 555 560 

TAT ACC AAT TTT AGT AAT CCT TTT TCA TTT AGG GCT AAT CCA GAT ATA 1728 
Tyr Thr Asn Phe Ser Asn Pro Phe Ser Phe Arg Ala Asn Pro Asp He 
565 570 575 

ATT AGA ATA GCT GAA GAA CTT CCT ATT CGT GGT GGT GAG CTT TAT ATA 1776 
He Arg He Ala Glu Glu Leu Pro He Arg Gly Gly Glu Leu Tyr He 
580 585, 590 

GAT AAA ATT GAA CTT ATT CTA GCA GAT GCA ACA TTT GAA GAA GAA TAT 1824 
Asp Lys He Glu Leu He Leu ^la Asp Ala Thr Phe Glu Glu Glu Tyr 
595 600 . 605 

GAT TTG GAA AGA GCA CAG AAG GCG GTG AAT GCC CTG TTT ACT TCT ACA 1872 
Asp Leu Glu Arg Ala Gin Lys Ala Val Asn Ala Leu Phe Thr Ser Thr 
610 615 620 



AAT CAA CTA GGG CTA AAA ACA GAT GTG ACG GAT TAT CAT ATT GAT CAA 
Asn Gin Leu Gly Leu Lys Thr Asp Val Thr Asp Tyr His He Asp Gin 
625 630 635 640 



1920 



GTT TCC AAT TTA GTT GAG TGT TTA TCG GAT GAA TTT TGT CTG GAT GAA 
Val Ser Asn Leu Val Glu Cys Leu Ser Asp Glu Phe Cys Leu Asp Glu 
645 650 655 



1968 



AAG AGA GAA TTA TCC GAG AAA GTC AAA CAT GCG AAG CGA CTC AGT GAT 



2016 
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Lys Arg Glu Leu Ser Glu Lys Val Lys His Ala Lys Arg Leu Ser Asp 
660 665 670 

GAA COG AAT TTA CTT CAA GAT CCA AAC TTC AGA GGG ATC AAT AGG CAA 2064 
Glu Arg Asn Leu Leu Gin Asp Pro Asn Phe Arg Gly He Asn Arg Gin 
675 680 685 

CCA GAC CGT GGC TGG AGA GGA AGC ACG GAT ATT ACT ATC CAA GGT GGA 2112 
Pro Asp Arg Gly Trp Arg Gly Ser Thr Asp He Thr He Gin Gly Gly 
690 695 700 

GAT GAC GTA TTC AAA GAG AAT TAC GTc' ACA TTA CCG GGT ACC TTT GAT 2160 
ASP Asp Val Phe Lys Glu Asn Tyr Val Thr Leu Pro Gly Thr Phe Asp 
705 710 715 720 

GAG TGC TAT CCA ACG TAT TTA TAT CAA AAA ATA GAT GAG TCG AAG TTA 2208 
Glu Cys Tyr Pro Thr Tyr Leu Tyr Gin Lys He Asp Glu Ser Lys Leu 

725 730 735 ..r^- 

AAA GCT TAT ACC CGC TAT GAA TTA AGA GGG TAT ATC GAG GAT AGT CAA 2256 
Lys Ala Tyr Thr Arg Tyr Glu Leu Arg Gly Tyr He Glu Asp Ser Gin 
740 745 750 

GAC TTA GAA ATC TAT TTA ATT CGC TAC AAT GCA AAA CAC GAG ACA GTA 2304 
Asp Leu Glu He Tyr Leu He Arg Tyr Asn Ala Lys His Glu Thr Val 
755 760 765 

AAC GTG CCA GGT ACG GGT TGC TTA TGG CCG CTT TCA GCC CAA AGT CCA 2352 
Asn Val Pro Gly Thr Gly Ser Leu Trp Pro Leu Ser Ala Gin Ser Pro 
770 775 780 

ATC GGA AAG TGT GGA GAA CCG AAT CGA TGC GCG CCA CAC CTT GAA TGG 2400 
He Gly Lys Cys Gly Glu Pro Asn Arg Cys Ala Pro His Leu Glu Trp 
785 790 795 800 

AAT CCT AAT CTA GAT TGC TCC TGC AGA GAC GGG GAA AAA TGT GCC CAT 2448 
Asn Pro Asn Leu Asp Cys Ser Cys Arg Asp Gly Glu Lys Cys Ala His 
805 810 815 

CAT TCC CAT CAT TTC TCC TTG GAC ATT GAT GTT GGA TGT ACA GAC TTA 2496 
His Ser His His Phe Ser Leu Asp He Asp Val Gly Cys Thr Asp Leu 
820 825 830 

AAT GAG GAC TTA GGT GTA TGG GTG ATA TTC AAG ATT AAG ACA CAA GAT 2544 
Asn Glu Asp Leu Gly Val Trp Val He Phe Lys He Lys Thr Gin Asp 
835 840 845 

GGC TAT GCA AGA CTA GGA AAT CTA GAG TTT CTC GAA GAG AAC CCA CTA 2592 
Gly Tyr Ala Arg Leu Gly Asn Leu Glu Phe Leu Glu Glu Asn Pro Leu 
850 855 860 

TTA GGG GAA GCA CTA GCT CGT GTG AAA AGA GCG GAG AAA AAA TGG AGA 2640 
Leu Gly Glu Ala Leu Ala Arg Val Lys Arg Ala Glu Lys Lys Trp Arg 
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870 875 880 



GAC AAA TGC GAA AAA TTG GAA TGG GAA AC A AAT ATT GTT TAT AAA GAG 2688 
Asp Lys Cys Glu Lys Leu Glu Trp Glu Thr Asn He Val Tyr Lys Glu 
885 890 895 

GCA AAA GAA TCT GTA GAT GCT TTA TTT GTA AAC TCT CAA TAT GAT AGA 2736 
Ala Lys Glu Ser Val Asp Ala Leu Phe Val Asn Ser Gin Tyr Asp Arg 
900 905 910 

TTA CAA GCG GAT ACG AAT ATC GCG ATG ATT CAT GCG GCA GAT AAA CGC 2784 
Leu Gin Ala Asp Thr Asn He Ala Met He His Ala Ala Asp Lys Arg 
915 920 925 

GTT CAT AGC ATT CGA GAA GCG TAT CTG CCA GAG CTG^TCT GTG ATT CCG 2832 
Val His Ser He Arg Glu Ala Tyr Leu Pro Glu Leu Ser Val He Pro 
930 935 940 



GGT GTC AAT GCG GCT ATT TTT GAA GAA TTA GAA GGG CGT ATT TTC ACT 
Gly Val Asn Ala Ala He Phe Glu Glu Leu Glu Gly Arg He Phe Thr 
945 950 955 .960 



28§0 



GCA TTC TCC CTA TAT GAT GCG AGA AAT GTC ATT AAA AAT GGC GAT TTC 2928 
Ala Phe Ser Leu Tyr Asp Ala Arg Asn Val He Lys Asn Gly Asp Phe 
965 970 975 

AAT AAT GGC TTA TCA TGC TGG AAC GTG AAA GGG CAT GTA GAT GTA GAA 2976 
Asn Asn Gly Leu Ser Cys Trp Asn Val Lys Gly His Val Asp Val Glu 
980 985 990 

GAA CAG AAC AAC CAT CGT TCG GTC CTT GTT GTT CCA GAA TGG GAA GCA 3024 
Glu Gin Asn Asn His. Arg Ser Val Leu Val Val Pro Glu Trp Glu Ala 
995 1000 1005 

GAA GTG TCA CAA GAA GTT CGT GTT TGT CCG GGT CGT GGC TAT ATC CTT 3072 
Glu Val Ser Gin Glu Val Arg Val Cys Pro Gly Arg Gly Tyr He Leu 
1010 1015 1020 

CGT GTT AC A GCG TAC AAA GAG GGA TAT GGA GAG GGC TGT GTA ACG ATT 3120 
Arg Val Thr Ala Tyr Lys Glu Gly Tyr Gly Glu Gly Cys Val Thr He 
1025 1030 1035 1040 

CAT GAG ATC GAA GAC AAT AC A GAC GAA CTG AAA TTC AGC AAC TGT GTA 3168 
His Glu He Glu Asp Asn Thr Asp Glu Leu Lys Phe Ser Asn Cys Val 
1045 1050 1055 

GAA GAG GAA GTA TAT CCA AAC AAC ACG GTA ACG TGT AAT AAT TAT ACT -3216 
Glu Glu Glu Val Tyr Pro Asn Asn Thr Val Thr Cys Asn Asn Tyr Thr 
1060 1065 1070 

GCG ACT CAA GAA GAA CAT GAG GGT ACG TAC ACT TCC CGT AAT CGA GGA 3264 
Ala Thr Gin Glu Glu His Glu Gly Thr Tyr Thr Ser Arg Asn Arg Gly 
1075 1080 1085 
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TAT GAC GAA GCC TAT GAA AGC AAT TCT TCT GTA CAT GCG TCA GTC TAT 3312 
Tyr Asp Glu Ala Tyr Glu Ser Asn Ser Ser Val His Ala Ser Val Tyr 
1090 1095 1100 

GAA GAA AAA TCG TAT ACA GAT AGA CGA AGA GAG AAT COT TGT GAA TCT 3360 
Glu Glu Lys Ser Tyr Thr Asp Arg Arg Arg Glu Asn Pro Cys Glu Ser 
1105 1110 1115 1120 

AAC AGA GGA TAT GGG GAT TAG ACA CCA CTA CCA GCT GGC TAT GTG ACA 3408 
Asn Arg Gly Tyr Gly Asp Tyr Thr Pro Leu Pro Ala Gly Tyr Val Thr 
1125 . 1130 1135 

AAA GAA TTA GAG TAC TTC CCA GAA ACC GAT AAG GTA TGG ATT GAG ATC 3456 
Lys Glu Leu Glu Tyr Phe Pro Glu Thr Asp Lys Var-Trp lle Glu He 
1140 1145 1150 

GGA GAA ACG GAA GGA ACA TTC ATC GTG GAC AGC GTG GAA TTA CTT CTT 3,5;|4 
Gly Glu Thr Glu Gly Thr Phe He Val Asp Ser Val Glu Leu Leu Leu 
1155 1160 1165 



ATG GAG GAA 
Met Glu Glu 
1170 



(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1171 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION:- SEQ ID NO : 4: 

Met Glu He Val Asn Asn Gin Asn Gin Cys Val Pro Tyr Asn Cys Leu 
1 5 10 15 

Asn Asn Pro Glu Asn Glu He Leu Asp He Glu Arg. Ser Asn Ser Thr 
20 25 30 

Val Ala Thr Asn He Ala Leu Glu He Ser Arg Leu Leu Ala. Ser Ala 
35 40 45 

Thr Pro He Gly Gly He Leu Leu Gly Leu Phe Asp Ala He Trp Gly 
50 55 60 

Ser He Gly Pro Ser Gin Trp Asp Leu Phe Leu Glu Gin He Glu Leu 
65 70 75 80 

Leu He Asp Gin Lys He Glu Glu Phe Ala Arg Asn Gin Ala He Ser 



3513 
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85 90 95 

Arg Leu Glu Gly He Ser Ser Leu Tyr Gly He Tyr Thr Glu Ala Phe 
100 105 110 

Arg Glu Trp Glu Ala Asp Pro Thr Asn Pro Ala Leu Lys Glu Glu Met 
115 120 125 

Arg Thr Gin Phe Asn Asp Met Asn Ser He Leu Val Thr Ala He Pro 
130 135 140 

Leu Phe Ser Val Gin Asn Tyr Gin Val Pro Phe Leu Ser Val Tyr Val 
145 150 155 160 

Gin Ala Ala Asn Leu His Leu Ser Val Leu Arg Asp^Val Ser Val Phe 
165 170 175 

Gly Gin Ala Trp Gly Phe Asp He Ala Thr He Asn Ser Arg Tyr Asn 
180 185 190 

Asp Leu Thr Arg Leu He Pro He Tyr Thr Asp Tyr Ala Val Arg Trp 
195 200 205 

Tyr Asn Thr Gly Leu Asp Arg Leu Pro Arg Thr Gly Gly Leu Arg Asn 
210 215 220 

Trp Ala Arg Phe Asn Gin Phe Arg Arg Glu Leu Thr He Ser Val Leu 
225 230 235 240 

Asp He He Ser Phe Phe Arg Asn Tyr Asp Ser Arg Leu Tyr Pro He 
245 250 255 

Pro Thr Ser Ser Gin Leu Thr Arg Glu Val Tyr Thr Asp Pro Val He 
260 265 270 

Asn He Thr Asp Tyr Arg Val Gly Pro Ser Phe Glu Asn He Glu Asn 
275 280 285 

Ser Ala He Arg Ser Pro His Leu Met Asp Phe Leu Asn Asn Leu Thr 
290 295 300 

He Asp Thr Asp Leu He Arg Gly Val His Tyr Trp Ala Gly His Arg 
305 310 315 320 

Val Thr Ser His Phe Thr Gly Ser Ser Gin Val He Thr Thr Pro Gin 
325 330 335 

Tyr Gly He Thr Ala Asn Ala Glu Pro Arg Arg Thr He Ala Pro Ser 
340 345 350 

Thr Phe Pro Gly Leu Asn Leu Phe Tyr Arg Thr Leu Ser Asn Pro Phe 
355 360 365 
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Phe Arg Arg Ser Glu Asn He Thr Pro Thr Leu Gly He Asn Val Val 
370 375 380 

Gin Gly Val Gly Phe He Gin Pro Asn Asn Ala Glu Val Leu Tyr Arg 
385 390 395 . 400 

Ser Arg Gly Thr Val Asp Ser Leu Asn Glu Leu Pro He Asp Gly Glu 
405 410 415 

Asn Ser Leu Val Gly Tyr Ser His Arg Leu Ser His Val Thr Leu Thr 
420 425 430 

Arg Ser Leu Tyr Asn Thr Asn He Thr Ser Leu Pro Thr Phe Val Trp 
435 440 445 

Thr His His Ser Ala Thr Asn Thr Asn Thr He Asn Pro Asp He He 
450 455 460 

Thr Gin He Pro Leu Val Lys Gly Phe Arg Leu Gly Gly Gly Thr Ser 
465 470 475 480 

Val He Lys Gly Pro Gly Phe Thr Gly Gly Asp He Leu Arg Arg Asn 
485 490 495 

Thr He Gly Glu Phe Val Ser Leu Gin Val Asn He Asn Ser Pro He 
500 505 510 

Thr Gin Arg Tyr Arg Leu Arg Phe Arg Tyr Ala Ser Ser Arg Asp Ala 
515 520 525 

Arg He Thr Val Ala He Gly Gly Gin He Arg Val Asp Met Thr Leu 
530 535 540 

Glu Lys Thr Met Glu He Gly Glu Ser Leu Thr Ser Arg Thr Phe Ser 
545 550 555 560 

Tyr Thr Asn Phe Ser Asn Pro Phe Ser Phe Arg Ala Asn Pro Asp He 
565 570 575 

He Arg He Ala Glu Glu Leu Pro He Arg Gly Gly Glu Leu Tyr He 
580 585 590 

Asp Lys He Glu Leu He Leu Ala Asp Ala Thr Phe Glu Glu Glu Tyr 
595 600 605 

Asp Leu Glu Arg Ala Gin Lys Ala Val Asn Ala Leu Phe Thr Ser Thr 
610 615 620 

Asn Gin Leu Gly Leu Lys Thr Asp Val Thr Asp Tyr His He Asp Gin 
625 630 635 640 

Val Ser Asn Leu Val Glu Cys Leu Ser Asp Glu Phe Cys Leu Asp Glu 
645 650 655 



40 




130-4080/PCT/CIP 



Lys Turg Glu Leu Ser Glu Lys Val Lys His Ala Lys Arg Leu Ser Asp 
660 665 670 

Glu Arg Asn Leu Leu Gin Asp Pro Asn Phe Arg Gly He Asn Arg Gin 
675 680 685 

Pro Asp Arg Gly Trp Arg Gly Ser Thr Asp He Thr He Gin Gly Gly 
690 695 700 

Asp Asp Val Phe Lys Glu Asn Tyr Val Thr Leu Pro Gly Thr Phe Asp 
705 710 715 720 

Glu Cys Tyr Pro Thr Tyr Leu Tyr Gin Lys He Asp Glu Ser Lys Leu 
725 730 ^ 735 

Lys Ala Tyr Thr Arg Tyr Glu Leu Arg Gly Tyr He Glu Asp Ser Gin 
740 745 750 

Asp Leu Glu He Tyr Leu He Arg Tyr Asn Ala Lys His Glu Thr Val 
755 760 765 

Asn Val Pro Gly Thr Gly Ser Leu Trp Pro Leu Ser Ala Gin Ser Pro 
770 775 780 

He Gly Lys Cys Gly Glu Pro Asn Arg Cys Ala Pro His Leu Glu Trp 
785 790 795 800 

Asn Pro Asn Leu Asp Cys Ser Cys Arg Asp Gly Glu Lys Cys Ala His 
805 810 815 

His Ser His His Phe Ser Leu Asp He Asp Val Gly Cys Thr Asp Leu 
820 825 830 

Asn Glu Asp Leu Gly Val Trp Val He Phe Lys He Lys Thr Gin Asp 
835 840 845 

Gly Tyr Ala Arg Leu Gly Asn Leu Glu Phe Leu Glu Glu Asn Pro Leu 
850 855 860 

Leu Gly Glu Ala Leu Ala Arg Val Lys Arg Ala Glu Lys Lys Trp Arg 
865 870 " . 875 ' ^ 880 

Asp Lys Cys Glu Lys Leu Glu Trp Glu Thr Asn He Val Tyr Lys Glu 
885 890 895 

Ala Lys Glu Ser Val Asp Ala Leu Phe Val Asn Ser Gin Tyr Asp Arg 
900 905 910 

Leu Gin Ala Asp Thr Asn He Ala Met He His Ala Ala Asp Lys Arg 
915 320 925 

Val His Ser He Arg Glu Ala Tyr Leu Pro Glu Leu Ser Val He Pro 
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930 935 940 

Gly Val Asn Ala Ala lie Phe Glu Glu Leu Glu Gly Arg He Phe Thr 
945 950 955 960 

Ala Phe Ser Leu Tyr Asp Ala Arg Asn Val He Lys Asn Gly Asp Phe 
965 970 975 

Asn Asn Gly Leu Ser Cys Trp Asn Val Lys Gly His Val Asp Val Glu 
980 985 990 

Glu Gin Asn Asn His Arg Ser Val Leu Val Val Pro Glu Trp Glu Ala 
995 1000 1005 

Glu Val Ser Gin Glu Val Arg Val Cys Pro Gly Arg^Gly Tyr He Leu 
1010 1015 1020 

Arg Val Thr Ala Tyr Lys Glu Gly Tyr Gly Glu Gly Cys Val Thr He 
1025 1030 1035 1040 

His Glu He Glu Asp Asn Thr Asp Glu Leu Lys Phe Ser Asn Cys Val 
1045 1050 1055 

Glu Glu Glu Val Tyr Pro Asn Asn Thr Val Thr Cys Asn Asn Tyr Thr 
1060 1065 1070 

Ala Thr Gin Glu Glu His Glu Gly Thr Tyr Thr Ser Arg Asn Arg Gly 
1075 1080 1085 

Tyr Asp Glu Ala Tyr Glu Ser Asn Ser Ser Val His Ala Ser Val Tyr 
1090 1095 1100 

Glu Glu Lys Ser Tyr Thr Asp Arg Arg Arg Glu Asn Pro Cys Glu Ser 
1105 1110 1115 1120 

Asn Arg Gly Tyr Gly Asp Tyr Thr Pro Leu Pro Ala Gly Tyr Val Thr 
1125 1130 1135 

Lys Glu Leu Glu Tyr Phe Pro Glu Thr Asp Lys Val Trp He Glu He 
'1140 1145 1150 

Gly Glu Thr Glu Gly Thr Phe He Val Asp Ser Val Glu Leu Leu Leu 
1155 1160 1165 

Met Glu Glu 
1170 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3558 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
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(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Hybrid sequence 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..3558 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: ' 

ATG GAG ATA GTG AAT AAT CAG AAT CAA TGC GTG CCT TAT AAT TGT TTA *8 
Met Glu He Val Asn Asn Gin Asn Gin Cys Val Pro Tyr Asn Cys Leu 
1 5 10 15 

AAT AAT CCT GAA AAT GAG ATA TTA GAT ATT GAA AGG TCA AAT AGT ACT 96 
Asn Asn Pro Glu Asn Glu He Leu Asp He Glu Arg Ser Asn Ser Thr 
20 25 ■ 30 

GTA GCA ACA AAC ATC GCC TTG GAG ATT AGT CGT CTG CTC GCT TCC GCA 144 
Val Ala Thr Asn He Ala Leu Glu He Ser Arg Leu Leu Ala Ser Ala 
35 40 45 

ACT CCA ATA GGG GGG ATT TTA TTA GGA TTG TTT GAT GCA ATA TGG GGG 192 
Thr Pro He Gly Gly He Leu Leu Gly Leu Phe Asp Ala He Trp Gly 
50 55 60 

TCT ATA GGC CCT TCA CAA TGG GAT TTA TTT TTA GAG CAA ATT GAG CTA 240 
Ser He Gly Pro Ser Gin Trp Asp Leu Phe Leu Glu Gin He Glu Leu 
65 70 75 80 

TTG ATT GAC CAA AAA ATA GAG .GAA TTC GCT AGA AAC CAG GCA ATT TCT 288 
Leu He Asp Gin Lys He Glu Glu Phe Ala Arg Asn Gin Ala He Ser 
85 90 ^' 95 

AGA TTG GAA GGG ATA AGC AGT CTG TAC GGA ATT TAT ACA GAA GCT TTT 336 
Arg Leu Glu Gly He Ser Ser Leu Tyr Gly He Tyr Thr Glu Ala Phe 
100 105 110 

AGA GAG TGG GAA GCA GAT CCT ACT AAT CCA GCA TTA AAA GAA GAG ATG 3 84 

Arg Glu Trp Glu Ala Asp Pro Thr Asn Pro Ala Leu Lys Glu Glu Met 
115 120 125 

CGT ACT CAA TTT AAT GAC ATG AAC AGT ATT CTT GTA ACA GCT ATT CCT 432 
Arg Thr Gin Phe Asn Asp Met Asn Ser He Leu Val Thr Ala He Pro 
130 135 140 
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CTT TTT TCA GTT CAA AAT TAT CAA GTC CCA TTT TTA TCA GTA TAT GTT 480 
Leu Phe Ser Val Gin Asn Tyr Gin Val Pro Phe Leu Ser Val Tyr Val 
145 150 155 . 160 

CAA GCT GCA AAT TTA CAT TTA TCG GTT TTG AGA GAT GTT TCA GTG TTT 528 
Gin Ala Ala Asn Leu His Leu Ser Val Leu Arg Asp Val Ser Val Phe 
165 170 175 

GGG CAG GCT TGG GGA TTT GAT ATA GCA AC A ATA AAT AGT CGT TAT AAT 576 
Gly Gin Ala Trp Gly Phe Asp He Ala Thr He Asn Ser Arg Tyr Asn 
180 185 190 

GAT CTG ACT AGA CTT ATT CCT ATA TAT ACA GAT TAT GCT GTA CGC TGG 624 
Asp Leu Thr Arg Leu He Pro He Tyr Thr Asp Tyr^Ala Val Arg Trp 
195 200 205 

TAG AAT ACG GGA TTA GAT CGC TTA CCA CGA ACT GGT GGG CTG CGA AAC 6^:2 
Tyr Asn Thr Gly Leu Asp Arg Leu Pro Arg Thr Gly Gly Leu Arg Asn 
210 215 220 

TGG GCA AGA TTT AAT CAG TTT AGA AGA GAG TTA ACA ATA TCA GTA' TTA 720 
Trp Ala Arg Phe Asn Gin Phe Arg Arg Glu Leu Thr He Ser Val Leu 
W== 225 230 235 240 

III 

^ GAT ATT ATT TCT TTT TTC AGA AAT TAC GAT TCT AGA TTA TAT CCA ATT 768 
^fl Asp He He Ser Phe Phe Arg Asn Tyr Asp Ser Arg Leu Tyr Pro He 
IP 245 250 255 

□ CCA ACA AGC TCC CAA TTA ACG CGG GAA GTA TAT ACA GAT CCG GTA ATT 816 

^0 Pro Thr Ser Ser Gin Leu Thr Arg Glu Val Tyr Thr Asp Pro Val He 
m 260 265 270 

B AAT ATA ACT GAC TAT AGA GTT GGC CCC. AGC TTC GAG AAT ATT GAG AAC 864 

5 Asn He Thr Asp Tyr Arg Val Gly Pro Ser Phe Glu Asn He Glu Asn 

275 2 80 285 

TCA GCC ATT AGA AGC CCC CAC CTT ATG GAC TTC TTA AAT AAT TTG ACC 912 
Ser Ala He Arg Ser Pro His Leu Met Asp Phe Leu Asn Asn Leu Thr 
290 295 300 



m 



ATT GAT ACG GAT TTG ATT AGA GGT GTT CAC TAT TGG GCA GGG CAT CGT 
He Asp Thr Asp Leu He Arg Gly Val His Tyr Trp Ala Gly His Arg 
305 310 315 320 

GTA ACT TCT CAT TTT ACA GGT AGT TCT CAA GTG ATA ACA ACC CCT CAA 
Val Thr Ser His Phe Thr Gly Ser Ser Gin Val He Thr Thr Pro Gin 
325 330 335 



960 



1008 



TAT GGG ATA ACC GCA AAT GCG GAA CCA AGA CGA ACT ATT GCT CCT AGT 1056 
Tyr Gly He Thr Ala Asn Ala'clu Pro Arg" Arg Thr He Ala Pro Ser 
340 345 350 
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ACT TTT CCA GOT CTT AAC CTA TTT TAT AGA AC A TTA TCA AAT CCT TTC 1104 
Thr Phe Pro Gly Leu Asn Leu Phe Tyr Arg Thr Leu Ser Asn Pro Phe 
355 360 365 

TTC CGA AGA TCA GAA AAT ATT ACT CCT ACC TTA GGG ATA AAT GTA GTA 1152 
Phe Arg Arg Ser Glu Asn lie Thr Pro Thr Leu Gly lie Asn Val Val 
370 375 380 

CAG GGA GTA GGG TTC ATT CAA CCA AAT AAT GCT GAA GTT CTA TAT AGA 1200 
Gin Gly Val Gly Phe lie Gin Pro Asn Asn Ala Glu Val Leu Tyr Arg 
385 390 395 400 

AGT AGG GGG ACA GTA GAT TCT CTT AAT GAG TTA CCA ATT GAT GGT GAG 1248 
Ser Arg Gly Thr Val Asp Ser Leu Asn Glu Leu Pro lie Asp Gly Glu 
405 410 ^ 415 

AAT TCA TTA GTT GGA TAT AGT CAT CGA TTA AGT CAT GTT ACA CTA ACC. 1296 
Asn Ser Leu Val Gly Tyr Ser His Arg Leu Ser His Val Thr Leu Thr 
420 425 430 

AGG TCG TTA TAT AAT ACT AAT ATA ACT AGC CTG CCA ACA TTT GTT TGG 1344 
Arg Ser Leu Tyr Asn Thr Asn He Thr Ser Leu Pro Thr Phe Val Trp 
435 440 445 

ACA CAT CAC AGT GCT ACT AAT ACA AAT ACA ATT AAT CCA GAT ATT ATT 1392 
Thr His His Ser Ala Thr Asn Thr Asn Thr He Asn Pro Asp He He 
450 455 460 

ACA CAA ATA CCT TTA GTG AAA GGA TTT AGA GTT TGG GGG GGC ACC TCT 1440 
Thr Gin He Pro Leu Val Lys Gly Phe Arg Val Trp Gly Gly Thr Ser 
465 470 475 480 

GTC ATT ACA GGA CCA GGA TTT ACA GGA GGG GAT ATC CTT CGA AGA AAT 1488 
Val He Thr Gly Pro Gly Phe Thr Gly Gly Asp He Leu Arg Arg Asn 
485 490 495 

ACC TTT GGT GAT TTT GTA TCT CTA CAA GTC AAT ATT AAT TCA CCA ATT 1536 
Thr Phe Gly Asp Phe Val Ser Leu Gin Val Asn He Asn Ser Pro He 
500 505 510 

ACC CAA AGA TAG CGT TTA AGA TTT CGT TAG GCT TCC AGT AGG GAT GCA 1584 
Thr Gin Arg Tyr Arg Leu Arg Phe Arg Tyr Ala Ser Ser Arg Asp Ala 
515 520 525 

CGA GTT ATA GTA TTA ACA GGA GCG GCA TCC ACA GGA GTG GGA GGC CAA 1632 
Arg Val He Val Leu Thr Gly Ala Ala Ser Thr Gly Val Gly Gly Gin 
530 535 540 

GTT AGT GTA AAT ATG CCT CTT CAG AAA ACT ATG GAA ATA GGG GAG AAC 1680 
Val Ser Val Asn Met Pro Leu Gin Lys Thr Met Glu He Gly Glu Asn 
545 550 555 560 

TTA ACA TCT AGA ACA TTT AGA TAT ACC GAT TTT AGT AAT CCT TTT TCA 172 8 
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Leu Thr Ser Arg 



Thr 
565 



Phe Arg Tyr Thr 



TTT AGA GCT AAT CCA 
Phe Arg Ala Asn Pro 
580 

GOT GCA GGT TCT ATT 
Gly Ala Gly Ser He 
595 

ATT ATT CTA GCA GAT 
He He Leu Ala Asp 
610 

GCA CAA AAG GCG GTG 
Ala Gin Lys Ala Val 
625 

TTA AAA ACC GAT GTG 
Leu Lys Thr Asp Val 
645 

GTG GAT TGT TTA TCA 
Val Asp Cys Leu Ser 
660 

TCC GAG AAA GTC AAA 
Ser Glu Lys Val Lys 
675 

CTT CAA GAT CCA AAC 
Leu Gin Asp Pro Asn 
690 

TGG AGA GGA AGT ACA 
Trp Arg Gly Ser Thr 
705 

AAA GAG AAT TAG GTC 
Lys Glu Asn Tyr Val 
725 

ACG TAT TTA TAT CAG 
Thr Tyr Leu Tyr Gin 
740 

CGT TAT GAA TTA AGA 
Arg Tyr Glu Leu Arg 
755 

TAT TTG ATC CGT TAC 
Tyr Leu He Arg Tyr 



GAT ATA ATT GGG 
Asp He He Gly 
585 

AGT AGC GGT GAA 
Ser Ser Gly Glu 
600 

GCA ACA TTT GAA 
Ala Thr Phe Glu 
615 

AAT GCC CTG TTT 
Asn Ala Leu Phe 
630 

ACG GAT TAT CAT 
Thr Asp Tyr His 



GAT GAA 
Asp Glu 



CAT GCG 
His Ala 



TTC AGA 
Phe Arg 
695 

GAT ATT 
Asp He 
710 

ACA CTA 
Thr Leu 



TTT TGT 
Phe Cys 
665 

AAG CGA 
Lys Arg 
680 

GGG ATC 
Gly He 



Asp Phe Ser Asn Pro Phe Ser 
570 575 

ATA AGT GAA CAA CCT CTA TTT 
He Ser Glu Gin Pro Leu Phe 
590 

CTT TAT ATA GAT AAA ATT GAA 
Leu Tyr He Asp Lys He Glu 
605 

GCA GAA TCT GAT TTA GAA AGA 
Ala Glu Ser Asp Leu Glu Arg 
620 

\. 

ACT TCT TCC AAT CAA ATC GGG 
Thr Ser Ser Asn Gin He Gly . 
635 640 

ATT GAT CAA GTA TCC AAT TTA 
He Asp Gin Val Ser Asn Leu 
650 655 

CTG GAT GAA AAG CGA GAA TTG 
Leu Asp Glu Lys Arg Glu Leu 
670 

CTC AGT GAT GAG CGG AAT TTA 
Leu Ser Asp Glu Arg Asn Leu 
685 

AAT AGA CAA CCA GAC CGT GGC 
Asn Arg Gin Pro Asp Arg Gly 
700 



ACC ATC 
Thr He 



CCG GGT 
Pro Gly 



AAA ATA GAT GAG 
Lys He Asp Glu 
745 

GGG TAT ATC GAA 
Gly Tyr He Glu 
760 

AAT GCA AAA CAC 
Asn Ala Lys His 



CAA GGA 
Gin Gly 
715 

ACC GTT 
Thr Val 
730 

TCG AAA 
Ser Lys 



GGA GAT GAC 
Gly Asp Asp 



GAT GAG TGC 
Asp Glu Cys 



GAT AGT 
Asp Ser 



GAA ATA 
Glu He 



TTA AAA GCT 
Leu Lys Ala 
750 

CAA GAC TTA 
Gin Asp Leu 
765 

GTA AAT GTG 
Val Asn Val 



GTA TTC 
Val Phe 
720 

TAT CCA 
Tyr Pro 
735 

TAT ACC 
Tyr Thr 



GAA ATC 
Glu He 



CCA GGC 
Pro Gly 



1776 



1824 



1872 



1968 



2016 



2064 



2112 



2160 



2208 



2256 



2304 



2352 
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770 



775 



780 



ACG GGT TCC TTA TGG CCG CTT TCA GCC CAA AGT CCA ATC GGA AAG TGT 2400 
Thr Gly Ser Leu Trp Pro Leu Ser Ala Gin Ser Pro lie Gly Lys Cys 
785 790 795 . 800 

GGA GAA CCG AAT CGA TGC GCG CCA CAC CTT GAA TGG AAT CCT GAT CTA 2448 
Gly Glu Pro Asn Arg Cys Ala Pro His Leu Glu Trp Asn Pro Asp Leu 
805 810 815 

GAT TGT TCC TGC AGA GAC GGG GAA AAA TGT GCA CAT CAT TCC CAT CAT 2496 
Asp Cys Ser Cys Arg Asp Gly Glu Lys Cys Ala His His Ser His His 
820 825 830 

TTC ACC TTG GAT ATT GAT GTT GGA TGT AC A GAC TTA--AAT GAG GAC TTA 2544 
Phe Thr Leu Asp lie Asp Val Gly Cys Thr Asp Leu Asn Glu Asp Leu 
835 840 845 

GGT GTA TGG GTG ATA TTC AAG ATT AAG ACG CAA GAT GGC CAT GCA AGA 259^ 
Gly Val Trp Val lie Phe Lys lie Lys Thr Gin Asp Gly His Ala Arg 
850 855 860 

CTA GGG AAT CTA GAG TTT CTC GAA GAG AAA CCA TTA TTA GGG GAA GCA 2640 
Leu Gly Asn Leu Glu Phe Leu Glu Glu Lys Pro Leu Leu Gly Glu Ala 
865 870 875 880 

CTA GCT CGT GTG AAA AGA GCG GAG AAG AAG TGG AGA GAC AAA CGA GAG 2688 
Leu Ala Arg Val Lys Arg Ala Glu Lys Lys Trp Arg Asp Lys Arg Glu 
885 890 895 

AAA CTG CAG TTG GAA AC A AAT ATT GTT TAT AAA GAG GCA AAA GAA TCT 2736 
Lys Leu Gin Leu Glu Thr Asn lie Val Tyr Lys Glu Ala Lys Glu Ser 
900 905 910 

GTA GAT GCT TTA TTT GTA AAC TCT CAA TAT GAT AGA TTA CAA GTG GAT 2784 
Val Asp Ala Leu Phe Val Asn Ser Gin Tyr Asp Arg Leu Gin Val Asp 
915 920 925 

ACG AAC ATC GCG ATG ATT CAT GCG GCA GAT AAA CGC GTT CAT AGA ATC 2832 
Thr Asn lie Ala Met lie His Ala Ala Asp Lys Arg Val His Arg lie 
930 935 940 

CGG GAA GCG TAT CTG CCA GAG TTG TCT GTG ATT CCA GGT GTC AAT GCG 2880 
Arg Glu Ala Tyr Leu Pro Glu Leu Ser Val lie Pro Gly Val Asn Ala 
945 950 955 960 

GCC ATT TTC GAA GAA TTA GAG GGA CGT ATT TTT ACA GCG TAT TCC TTA 2928 
Ala He Phe Glu Glu Leu Glu Gly Arg He Phe Thr Ala Tyr Ser Leu 
965 970 975 

TAT GAT GCG AGA AAT GTC ATT A^^VA AAT GGC GAT TTC AAT AAT GGC TTA 297 6 

Tyr Asp Ala Arg Asn Val He Lys Asn Gly Asp Phe Asn Asn Gly Leu 
980 985 990 
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TTA TGC TGG AAC GTG AAA GGT CAT GTA GAT GTA GAA GAG CAA AAC AAC 
Leu Cys Trp Asn Val Lys Gly His Val Asp Val Glu Glu Gin Asn Asn 
995 1000 1005 

CAC CGT TOG GTC CTT GTT ATC CCA GAA TGG GAG GCA GAA GTG TCA CAA 
His Arg Ser Val Leu Val lie Pro Glu Trp Glu Ala Glu Val Ser Gin 
1010 1015 1020 

GAG GTT CGT GTC TGT CCA GGT CGT GGC' TAT ATC CTT CGT GTC ACA GCA 
Glu Val Arg Val Cys Pro Gly Arg Gly Tyr He Leu Arg Val Thr Ala 
1025 1030 1035 1040 

TAT AAA GAG GGA TAT GGA GAG GGC TGC GTA ACG ATC CAT GAG ATC GAA 
Tyr Lys Glu Gly Tyr Gly Glu Gly Cys Val Thr 11^- His Glu He Glu 
1045 1050 1055 



3024 



3072 



3120 



3168 



^0 



□ 



GAC AAT ACA GAC GAA CTG AAA TTC AGC AAC TGT GTA GAA GAG GAA GTA 
Asp Asn Thr Asp Glu Leu Lys Phe Ser Asn Cys Val Glu Glu Glu Val 
1060 1065^ 1070 

TAT CCA AAC AAC ACA GTA ACG TGT AAT AAT TAT ACT GGG ACT CAA GAA 
Tyr Pro Asn Asn Thr Val Thr Cys Asn Asn Tyr Thr Gly Thr Gin. Glu 
1075 1080 1085 

GAA TAT GAG GGT ACG TAC ACT TCT CGT AAT CAA GGA TAT GAC GAA GCC 
Glu Tyr Glu Gly Thr Tyr Thr Ser Arg Asn Gin Gly Tyr Asp Glu Ala 
1090 1095 1100 

TAT GGT AAT AAC CCT TCC GTA CCA GCT GAT TAC GCT TCA GTC TAT GAA 
Tyr Gly Asn Asn Pro Ser Val Pro Ala Asp Tyr Ala Ser Val Tyr Glu 
1105 1110 1115 1120 

GAA AAA TCG TAT ACA GAT GGA CGA AGA GAG AAT CCT TGT GAA TCT AAC 
Glu Lys Ser Tyr Thr Asp Gly Arg Arg Glu Asn Pro Cys Glu Ser Asn 
1125 1130 1135 

AGA GGC TAT GGG GAT TAC ACA CCA CTA CCG GCT GGT TAT GTA ACA AAG 
Arg Gly Tyr Gly Asp Tyr Thr Pro Leu Pro Ala Gly Tyr Val Thr Lys 
1140 1145 1150 

GAT TTA GAG TAC TTC CCA GAG ACC GAT AAG GTA TGG ATT GAG ATC GGA 
Asp Leu Glu Tyr Phe Pro Glu Thr Asp Lys Val Trp He Glu He Gly 
1155 1160 1165 

GAA ACA GAA GGA ACA TTC ATC GTG GAT AGC GTG GAA TTA CTC CTT ATG 
Glu Thr Glu Gly Thr Phe He Val Asp Ser Val Glu Leu Leu Leu Met 
1170 1175 1180 



3216 



3264 



3312 



3360 



3408 



3456 



3504 



3552 



GAG GAA 
Glu Glu 
1185 



3558 
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(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1186 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met Glu lie Val Asn Asn Gin Asn Gin Cys Val Pro Tyr Asn Cys Leu 
1 5 10 15 

Asn Asn Pro Glu Asn Glu lie Leu Asp He Glu Arg Ser Asn Ser Thr 
20 25 30 

Val Ala Thr Asn He Ala Leu Glu He Ser Arg Leu Leu Ala Ser Ala 
35 40 45 

Thr Pro He Gly Gly He Leu Leu Gly Leu Phe Asp Ala He Trp Gly 
50 55 60 

Ser He Gly Pro Ser Gin Trp Asp Leu Phe Leu Glu Gin He Glu Leu 
65 70 75 80 

Leu He Asp Gin Lys He Glu Glu Phe Ala Arg Asn Gin Ala He Ser 
85 90 95 

Arg Leu Glu Gly He Ser Ser Leu Tyr Gly He Tyr Thr Glu Ala Phe 
100 105 110 

Arg Glu Trp Glu Ala Asp Pro Thr Asn Pro Ala Leu Lys Glu Glu Met 
115 120 125 

Arg Thr Gin Phe Asn Asp Met Asn Ser He Leu Val Thr Ala He Pro 
130 135 140 

Leu Phe Ser Val Gin Asn Tyr Gin Val Pro . Phe Leu Ser Val Tyr Val 
145 150 155 160 

Gin Ala Ala Asn Leu His Leu Ser Val Leu Arg Asp Val Ser Val Phe 
165 170 175 

Gly Gin Ala Trp Gly Phe Asp He Ala. Thr He Asn Ser Arg Tyr Asn 
180 185 190 

Asp Leu Thr Arg Leu He Pro He Tyr Thr Asp Tyr Ala Val Arg Trp 
195 200 205 

Tyr Asn Thr Gly Leu Asp Arg Leu Pro Arg Thr Gly Gly Leu Arg Asn 
210 215 220 
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Trp Ala Arg Phe Asn Gin Phe Arg Arg Glu Leu Thr He Ser Val Leu 
225 230 235 240 

Asp He He Ser Phe Phe Arg Asn Tyr Asp Ser Arg Leu Tyr Pro He 
245 250 255 

Pro Thr Ser Ser Gin Leu Thr Arg Glu Val Tyr Thr Asp Pro Val He 
260 265 270 

Asn He Thr Asp Tyr Arg Val Gly Pro Ser Phe Glu Asn He Glu Asn 
275 280 285 

Ser Ala He Arg Ser Pro His Leu Met Asp Phe Leu Asn Asn Leu Thr 
290 295 300\ 

He Asp Thr Asp Leu He Arg Gly Val His Tyr Trp Ala Gly His Arg 
305 310 315 320 

Val Thr Ser His Phe Thr Gly Ser Ser Gin Val He Thr Thr Pro Gin 
325 330 335 

Tyr Gly He Thr Ala Asn Ala Glu Pro Arg Arg Thr He Ala Pro Ser 
340 345 350 

Thr Phe Pro Gly Leu Asn Leu Phe Tyr Arg Thr Leu Ser Asn Pro Phe 
355 360 365 

Phe Arg Arg Ser Glu Asn He Thr Pro Thr Leu Gly He Asn Val Val 
370 375 380 

Gin Gly Val Gly Phe He Gin Pro Asn Asn Ala Glu Val Leu Tyr Arg 
385 390 395 400 

Ser Arg Gly Thr Val Asp Ser Leu Asn Glu Leu Pro He Asp Gly Glu 
405 410 415 

Asn Ser Leu Val Gly Tyr Ser His Arg Leu Ser His Val Thr Leu Thr 
420 425 430 

Arg Ser Leu Tyr Asn Thr Asn He Thr Ser Leu Pro Thr Phe Val Trp 
435 440 445 

Thr His His Ser Ala Thr Asn Thr Asn Thr He Asn Pro Asp He He 
450 455 460 

Thr Gin He Pro Leu Val Lys Gly Phe Arg Val Trp Gly Gly Thr Ser 
465 470 475 480 

Val He Thr Gly Pro Gly Phe Thr Gly Gly Asp He Leu Arg Arg Asn 
485 490 495 

Thr Phe Gly Asp Phe Val Ser Leu Gin Val Asn He Asn Ser Pro He 



50 



130-4080/PCT/CIP 



500 505 510 

Thr Gin Arg Tyr Arg Leu Arg Phe Arg Tyr Ala Ser . Ser Arg Asp Ala 
515 520 525 

Arg Val He Val Leu Thr Gly Ala Ala Ser Thr Gly Val Gly Gly Gin 
530 535 540 

Val Ser Val Asn Met Pro Leu Gin Lys Thr Met Glu He Gly Glu Asn 
545 550 555 560 

Leu Thr Ser Arg Thr Phe Arg Tyr Thr Asp Phe Ser Asn Pro Phe Ser 
565 570 575 

Phe Arg Ala Asn Pro Asp He He Gly He Ser Glu^Gln Pro Leu Phe 
580 585 530 

Gly Ala Gly Ser He Ser Ser Gly Glu Leu Tyr He Asp Lys He Glu 
595 600 605 

Q He He Leu Ala Asp Ala Thr Phe Glu Ala Glu Ser Asp Leu Glu Arg 

610 615 620 

m 

ff^ Ala Gin Lys Ala Val Asn Ala Leu Phe Thr Ser Ser Asn Gin He Gly 

m 625 630 635 640 



p 
ill 



Leu Lys Thr Asp Val Thr Asp Tyr His He Asp Gin Val Ser Asn Leu 
645 650 655 

Val Asp Cys Leu Ser Asp Glu Phe Cys Leu Asp Glu Lys Arg Glu Leu 
660 665 670 

Ser Glu Lys Val Lys His Ala Lys Arg Leu Ser Asp Glu Arg Asn Leu 
675 680 685 

Leu Gin Asp Pro Asn Phe Arg Gly He Asn Arg Gin Pro Asp Arg Gly 
690 695 700 

Trp Arg Gly Ser Thr Asp He Thr He Gin Gly Gly Asp Asp Val Phe 
705 710 715 720 

Lys Glu Asn Tyr Val Thr Leu Pro Gly Thr Val Asp Glu Cys Tyr Pro 
725 730 735 

Thr Tyr Leu Tyr Gin Lys He Asp Glu Ser Lys Leu Lys Ala Tyr Thr 
740 745 750 

Arg Tyr Glu Leu Arg Gly Tyr He Glu Asp Ser Gin Asp Leu Glu He 
755 760 765 

Tyr Leu He Arg Tyr Asn Ala Lys His Glu He Val Asn Val Pro Gly 
770 775 780 
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Thr Gly Ser Leu Trp Pro Leu Ser Ala Gin Ser Pro He Gly Lys Cys 
785 790 795 800 

Gly Glu Pro Asn Arg Cys Ala Pro His Leu Glu Trp Asn Pro Asp Leu 
805 810 815 

Asp Cys Ser Cys Arg Asp Gly Glu Lys Cys Ala His His Ser His His 
820 825 830 

Phe Thr Leu Asp He Asp Val Gly Cys Thr Asp Leu Asn Glu Asp Leu 
835 840 845 

Gly Val Trp Val He Phe Lys He Lys Thr Gin Asp Gly His Ala Arg 
850 855 860 

Leu Gly Asn Leu Glu Phe Leu Glu Glu Lys Pro Leu Leu Gly Glu Ala 
865 870 875 880 

Leu Ala Arg Val Lys Arg Ala Glu Lys Lys Trp Arg Asp Lys Arg Glu 
885 890 895 

Lys Leu Gin Leu Glu Thr Asn He Val Tyr Lys Glu Ala Lys Glu Ser 



ill 900 905 910 



Val Asp Ala Leu Phe Val Asn Ser Gin Tyr Asp Arg Leu Gin Val Asp 
915 920 925 

Thr Asn He Ala Met He His Ala Ala Asp Lys Arg Val His Arg He 
930 935 940 

Arg Glu Ala Tyr Leu Pro Glu Leu Ser Val He Pro Gly Val Asn Ala 
945 950 955 960 

Ala He Phe Glu Glu Leu Glu Gly Arg He Phe Thr Ala Tyr Ser Leu 
965 970 975 

Tyr Asp Ala Arg Asn Val He Lys Asn Gly Asp Phe Asn Asn Gly Leu 
980 985 990 

Leu Cys Trp Asn Val Lys Gly His Val Asp Val Glu Glu Gin Asn Asn 
995. 1000 1005 

His Arg Ser Val Leu Val He Pro Glu Trp Glu Ala Glu Val Ser Gin 
1010 1015 1020 

Glu Val Arg Val Cys Pro Gly Arg Gly Tyr He Leu Arg Val Thr Ala 
1025 1030 1035 1040 

Tyr Lys Glu Gly Tyr Gly Glu Gly Cys Val Thr He His Glu He Glu. 

1045 1050 1055 

Asp Asn Thr Asp Glu Leu Lys Phe Ser Asn Cys Val Glu Glu Glu Val 
1060 1065 1070 
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Tyr Pro Asn Asn Thr Val Thr Cys Asn Asn Tyr Thr Gly Thr Gin Glu 
1075 1080 1085 

Glu Tyr Glu Gly Thr Tyr Thr Ser Arg Asn Gin Gly Tyr Asp Glu Ala 
1090 1095 1100 

Tyr Gly Asn Asn Pro Ser Val Pro Ala Asp Tyr Ala Ser Val Tyr Glu 
1105 1110 1115 1120 

Glu Lys Ser Tyr Thr Asp Gly Arg Arg Glu Asn Pro Cys Glu Ser Asn 
1125 1130 1135 

Arg Gly Tyr Gly Asp Tyr Thr Pro Leu Pro Ala Gly Tyr Val Thr Lys 
1140 1145 ^ 1150 

Asp Leu Glu Tyr Phe Pro Glu Thr Asp Lys Val Trp He Glu He Gly 
1155 1160 1165 

Glu Thr Glu Gly Thr Phe He Val Asp Ser Val Glu Leu Leu Leu Met 
1170 1175 1180 

Glu Glu 
1185 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3579 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Hybrid toxin 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1. .3579 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

ATG GAT AAC AAT CCG AAC ATC AAT GAA TGC ATT CCT TAT AAT TGT TTA 48 

Met Asp Asn Asn Rro Asn He Asn Glu Cys He Pro Tyr Asn Cys Leu 
1 ^5 10 15 
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AGT AAC CCT GAA GTA GAA GTA TTA GGT GGA GAA AGA ATA GAA ACT GGT 96 
Ser Asn Pro Glu Val Glu Val Leu Gly Gly Glu Arg He Glu Thr Gly 
20 25 30 

TAG ACC CCA ATC GAT ATT TCC TTG TCG CTA ACG CAA TTT CTT TTG AGT 144 
Tyr Thr Pro lie Asp He Ser Leu Ser Leu Thr Gin Phe Leu Leu Ser 
35 40 45 

GAA TTT GTT CCC GGT GCT GGA TTT GTG TTA GGA CTA GTT GAT ATA ATA 192 
Glu Phe Val Pro Gly Ala Gly Phe Val Leu Gly Leu Val Asp He He 
50 55 60 

TGG GGA ATT TTT GGT CCC TCT CAA TGG GAC GCA TTT CTT GTA CAA ATT 240 
Trp Gly He Phe Gly Pro Ser Gin Trp Asp Ala Phe Leu Val Gin He 
65 70 . 75 80 . . 

GAA CAG TTA ATT AAC CAA AGA ATA GAA GAA TTC GCT AGG AAC CAA GCC , 288 
Glu Gin Leu He Asn Gin Arg He Glu Glu Phe Ala Arg Asn Gin Ala .,.^r 
85 90 95 '"■'^ 

ATT TCT AGA TTA GAA GGA CTA AGC AAT CTT TAT CAA ATT TAC GCA GAA 336 
He Ser Arg Leu Glu Gly Leu Ser Asn Leu Tyr Gin He Tyr Ala Glu 
100 105 110 

TCT TTT AGA GAG TGG GAA GCA GAT CCT ACT AAT CCA GCA TTA AGA GAA 384 
Ser Phe Arg Glu Trp Glu Ala Asp Pro Thr Asn Pro Ala Leu Arg Glu 
115 120 125 

GAG ATG CGT ATT CAA TTC AAT GAC ATG AAC AGT GCC CTT ACA ACC GCT 432 
Glu Met Arg He Gin Phe Asn Asp Met Asn Ser Ala Leu Thr Thr Ala 
130 135 140 



ATT CCT CTT TTT GCA GTT CAA AAT TAT CAA GTT CCT CTT TTA TCA GTA 
He Pro Leu Phe Ala Val Gin Asn Tyr Gin Val Pro Leu Leu Ser Val 
145 150 155 160 



480 



TAT GTT CAA GCT GCA AAT TTA CAT TTA TCA GTT TTG AGA GAT GTT TCA 528 
Tyr Val Gin Ala Ala Asn Leu His Leu Ser Val Leu Arg Asp Val Ser 
165 170 175 

GTG TTT GGA CAA AGG TGG GGA TTT GAT GCC GCG ACT ATC AAT AGT CGT 576 
Val Phe Gly Gin Arg Trp Gly Phe Asp Ala Ala Thr He Asn Ser Arg 
180 IBS 190 

TAT AAT GAT TTA ACT AGG CTT ATT GGC AAC TAT ACA GAT CAT GCT GTA 624 
Tyr Asn Asp Leu Thr Arg Leu He Gly Asn Tyr Thr Asp His Ala Val 
195 200 205 

CGC TGG TAC AAT ACG GGA TTA GAG CGT GTA TGG GGA CCG GAT TCT AGA 672 
Arg Trp Tyr Asn Thr Gly Leu Glu Arg Val Trp Gly Pro Asp Ser Arg 
210 215 220 

GAT TGG ATA AGA TAT AAT CAA TTT AGA AGA GAA TTA ACA CTA ACT GTA 720 
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Asp Trp lie Arg Tyr Asn Gin Phe Arg Arg Glu Leu Thr Leu Thr Val 
225 230 235 240 



TTA GAT ATC GTT TCT CTA TTT CCG AAC TAT GAT AGT AGA ACG TAT CCA 
Leu Asp lie Val Ser Leu Phe Pro Asn Tyr Asp Ser Arg Thr Tyr Pro 
245 250 255 



768 



ATT CGA ACA GTT TCC CAA TTA ACA AGA GAA :ATT TAT ACA AAC CCA GTA 
lie Arg Thr Val Ser Gin Leu Thr Arg Glu lie Tyr Thr Asn Pro Val 
260 265 270 



816 



TTA GAA AAT TTT GAT GGT AGT TTT CGA GGC TCG GCT CAG GGC ATA GAA 
Leu Glu Asn Phe Asp Gly Ser Phe Arg Gly Ser Ala Gin Gly lie Glu 
275 280 285 



864 



GGA AGT ATT AGG AGT CCA CAT TTG ATG GAT ATA CTT AAC AGT ATA ACC 
Gly Ser lie Arg Ser Pro His Leu Met Asp lie Leu Asn Ser lie Thr 
290 295 300 



ATC TAT ACG GAT GCT CAT AGA GGA GAA TAT TAT TGG TCA GGG CAT CAA 
lie Tyr Thr Asp Ala His Arg Gly Glu Tyr Tyr Trp Ser Gly His Gin 
305 310 315 '320 



960 



ATA ATG GCT TCT CCT GTA GGG TTT TCG GGG CCA GAA TTC ACT TTT CCG 
lie Met Ala Ser Pro Val Gly Phe Ser Gly Pro Glu Phe Thr Phe Pro 
325 330 335 



1008 



CTA TAT GGA ACT ATG GGA AAT GCA GCT CCA CAA CAA CGT ATT GTT GCT 
Leu Tyr Gly Thr Met Gly Asn Ala Ala Pro Gin Gin Arg lie Val Ala 
340 345 350 



1056 



CAA CTA GGT CAG GGC GTG TAT AGA ACA TTA TCG TCC ACT TTA TAT AGA 
Gin Leu Gly Gin Gly Val Tyr Arg Thr Leu Ser Ser Thr Leu Tyr Arg 
355 360 365 



1104 



AGA CCT TTT AAT ATA GGG ATA AAT AAT CAA CAA CTA TCT GTT CTT GAC 
Arg Pro Phe Asn lie Gly lie Asn Asn Gin Gin Leu Ser Val Leu Asp 
370 375 380 



1152 



GGG ACA GAA TTT GCT TAT GGA ACC TCC TCA AAT TTG CCA TCC GCT GTA 
Gly Thr Glu Phe Ala Tyr Gly Thr Ser Ser Asn Leu Pro Ser Ala Val 
385 390 395 ' 400 



1200 



TAC AGA AAA AGC GGA ACG GTA GAT TCG CTG GAT GAA ATA CCG CCA CAG 
Tyr Arg Lys Ser Gly Thr Val Asp Ser Leu Asp Glu lie Pro Pro Gin 
405 410 415 



1248 



AAT AAC AAC GTG CCA CCT AGG CAA GGA TTT AGT CAT CGA TTA AGC CAT 
Asn Asn Asn Val Pro Pro Arg Gin Gly Phe Ser His Arg Leu Ser His 
420 425 430 



1296 



GTT TCA ATG TTT CGT TCA GGC TTT AGT AAT AGT AGT GTA AGT ATA ATA 
Val Ser Met Phe Arg Ser Gly Phe Ser Asn Ser Ser Val Ser lie He 



1344 
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435 440 445 

AGA GCT CCT ATG TTC TCT TGG ATA CAT CGT AGT GCA ACT CTT ACA AAT 1392 
Arg Ala Pro Met Phe Ser Trp lie His Arg Ser Ala Thr Leu Thr Asn 
450 455 460 

ACA ATT GAT CCA GAG AGA ATT AAT CAA ATA CCT TTA GTG AAA GGA TTT 1440 
Thr lie Asp Pro Glu Arg lie Asn Gin lie Pro Leu Val Lys Gly Phe 
465 470 475 480 

AGA GTT TGG GGG GGC ACC TCT GTC ATT ACA GGA CCA GGA TTT ACA GGA 1488 
Arg Val Trp Gly Gly Thr Ser Val lie Thr Gly Pro Gly Phe Thr Gly 
485 490 495 



GGG GAT ATC CTT CGA AGA AAT ACC TTT GGT GAT TTT-^GTA TCT CTA CAA 1536 
y Asp 



Gly Asp lie Leu Arg Arg Asn Thr Phe Gly Asp Phe Val Ser Leu Gin 



500 505 510 

GTC AAT ATT AAT TCA CCA ATT ACC CAA AGA TAC CGT TTA AGA TTT CGT I'S^ 
Val Asn lie Asn Ser Pro lie Thr Gin Arg Tyr Arg Leu Arg Phe Arg 
515 520 525 

TAC GCT TCC AGT AGG GAT GCA CGA GTT ATA GTA TTA ACA GGA GCG GCA 1632 
Tyr Ala Ser Ser Arg Asp Ala Arg Val lie Val Leu Thr Gly Ala Ala 
530 535 540 

TCC ACA GGA GTG GGA GGC CAA GTT AGT GTA AAT ATG CCT CTT CAG AAA 1680 
Ser Thr Gly Val Gly Gly Gin Val Ser Val Asn Met Pro Leu Gin Lys 
545 550 555 560 

ACT ATG GAA ATA GGG GAG AAC TTA ACA TCT AGA ACA TTT AGA TAT ACC 1728 
Thr Met Glu lie Gly Glu Asn Leu Thr Ser Arg Thr Phe Arg Tyr Thr 
565 570 575 

GAT TTT AGT AAT CCT TTT TCA TTT AGA GCT AAT CCA GAT ATA ATT GGG 1776 
Asp Phe Ser Asn Pro Phe Ser Phe Arg Ala Asn Pro Asp lie lie Gly 
580 585 590 

ATA AGT GAA CAA CCT CTA TTT GGT GCA GGT TCT ATT AGT AGC GGT GAA 1824 
lie Ser Glu Gin Pro Leu Phe Gly Ala Gly Ser lie Ser Ser Gly Glu 
595 600 605 

CTT TAT ATA GAT AAA ATT GAA ATT ATT CTA GCA GAT GCA ACA TTT GAA 1872 
Leu Tyr lie Asp Lys lie Glu lie lie Leu Ala Asp Ala Thr Phe Glu 
610 615 620 

GCA GAA TCT GAT TTA GAA AGA GCA CAA AAG GCG GTG AAT GCC CTG TTT 1920 
Ala Glu Ser Asp Leu Glu Arg Ala Gin Lys Ala Val Asn Ala Leu Phe 
625 630 635 640 

ACT TCT TCC AAT C^ ATC GGG TTA AAA ACC GAT GTG " ACG GAT TAT CAT 1968 
Thr Ser Ser Asn Gin lie Gly Leu Lys Thr Asp Val Thr Asp Tyr His 
645 650 655 
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ATT GAT CAA GTA TCC AAT TTA GTG GAT TGT TTA TCA GAT GAA TTT TGT 2016 
lie Asp Gin Val Ser Asn Leu Val Asp Cys Leu Ser Asp Glu Phe Cys 
660 665 670 

CTG GAT GAA AAG CGA GAA TTG TCC GAG AAA GTC AAA CAT GCG AAG CGA 2064 
Leu Asp Glu Lys Arg Glu Leu Ser Glu Lys Val Lys His Ala Lys Arg 
675 680 685 

CTC AGT GAT GAG CGG AAT TTA CTT CAA GAT CCA AAC TTC AGA GGG ATC 2112 
Leu Ser Asp Glu Arg Asn Leu Leu Gin Asp Pro Asn Phe Arg Gly lie 
690 695 700 

AAT AGA CAA CCA GAC CGT GGC TGG AGA GGA AGT ACA GAT ATT ACC ATC 2160 
Asn Arg Gin Pro Asp Arg Gly Trp Arg Gly Ser Thr^Asp He Thr He 
7G5 710 715 720 

CAA GGA GGA GAT GAC GTA TTC AAA GAG AAT TAC GTC ACA CTA CCG GGT 22JD.8 
Gin Gly Gly Asp Asp Val Phe Lys Glu Asn Tyr Val Thr Leu Pro Gly ' 
725 730 735 

ACC GTT GAT GAG TGC TAT CCA ACG TAT TTA TAT CAG AAA ATA GAT GAG 2256 
Thr Val Asp Glu Cys Tyr Pro Thr Tyr Leu Tyr Gin Lys lie Asp Glu 
740 745 750 

TCG AAA TTA AAA GCT TAT ACC CGT TAT GAA TTA AGA GGG TAT ATC GAA 2304 
Ser Lys Leu Lys Ala Tyr Thr Arg Tyr Glu Leu Arg Gly Tyr He Glu 
755 760 765 

GAT AGT CAA GAC TTA GAA ATC TAT TTG ATC CGT TAC AAT GCA AAA CAC 2352 
Asp Ser Gin Asp Leu Glu He Tyr Leu He Arg Tyr Asn Ala Lys His 
770 775 780 

GAA ATA GTA AAT GTG CCA GGC ACG GGT TCC TTA TGG CCG CTT TCA GCC 2400 
Glu He Val Asn Val Pro Gly Thr Gly Ser Leu Trp Pro Leu Ser Ala 
785 790 795 800 

CAA AGT CCA ATC GGA AAG TGT GGA GAA CCG AAT CGA TGC GCG CCA CAC 2448 
Gin Ser Pro He Gly Lys Cys Gly Glu Pro Asn Arg Cys Ala Pro His 
805 810 815 

CTT GAA TGG AAT CCT GAT CTA GAT TGT TCC TGC AGA GAC GGG GAA AAA 2496 
Leu Glu Trp Asn Pro Asp Leu Asp Cys Ser Cys Arg Asp Gly Glu Lys 
820 825 830 

TGT GCA CAT CAT TCC CAT CAT TTC ACC TTG GAT ATT GAT GTT GGA TGT 2544 
Cys Ala His His Ser His His Phe Thr Leu Asp He Asp Val Gly Cys 
835 840 845 

ACA GAC TTA AAT GAG GAC TTA GGT GTA TGG GTG ATA TTC AAG ATT AAG 2 592 

Thr Asp Leu Asn G^u Asp Leu Gly Val Trp Val He Phe Lys He Lys 
850 " 855 860 
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ACG CAA GAT GGC CAT GCA AGA CTA GGG AAT CTA GAG TTT CTC GAA GAG 2640 
Thr Gin Asp Gly His Ala Arg Leu Gly Asn Leu Glu Phe Leu Glu Glu 
865 870 875 880 

AAA CCA TTA TTA GGG GAA GCA CTA GCT CGT GTG AAA AGA GCG GAG AAG 2688 
Lys Pro Leu Leu Gly Glu Ala Leu Ala Arg Val Lys Arg Ala Glu Lys 
885 890 895 

AAG TGG AGA GAC AAA CGA GAG AAA CTG CAG TTG GAA ACA AAT ATT GTT 2736 
Lys Trp Arg Asp Lys Arg Glu Lys Leu Gin Leu Glu Thr Asn lie Val 
900 905 910 

TAT AAA GAG GCA AAA GAA TCT GTA GAT GCT TTA TTT GTA AAC TCT CAA 2784 
Tyr Lys Glu Ala Lys Glu Ser Val Asp Ala Leu Phe Val Asn Ser Gin 
915 920 . -^25 

TAT GAT AGA TTA CAA GTG GAT ACG AAC ATC GCG ATG ATT CAT GCG GCA 2832 

Tyr Asp Arg Leu Gin Val Asp Thr Asn lie Ala Met lie His Ala Ala 

930 935 940 

GAT AAA CGC GTT CAT AGA ATC CGG GAA GCG TAT CTG CCA GAG TTG TCT 2880 
Asp Lys Arg Val His Arg lie Arg Glu Ala Tyr Leu Pro Glu Leu Ser 
945 950 955 960 

GTG ATT CCA GGT GTC AAT GCG GCC ATT TTC GAA GAA TTA GAG GGA CGT 2928 
Val lie Pro Gly Val Asn Ala Ala He Phe Glu Glu Leu Glu Gly Arg 
965 970 975 

p^rjxrjy ipTT ACA GCG TAT TCC TTA TAT GAT GCG AGA AAT GTC ATT AAA AAT 2976 
He Phe Thr Ala Tyr Ser Leu Tyr Asp Ala Arg Asn Val He Lys Asn 
980 985 990 

GGC GAT TTC AAT AAT GGC TTA TTA TGC TGG AAC GTG AAA GGT CAT GTA 3 024 

Gly Asp Phe Asn Asn Gly Leu Leu Cys Trp Asn Val Lys Gly His Val 
995 1000 1005 

GAT GTA GAA GAG CAA AAC AAC CAC CGT TCG GTC CTT GTT ATC CCA GAA 3 072 

Asp Val Glu Glu Gin Asn Asn His Arg Ser Val Leu Val He Pro Glu 
1010 1015 1020 

TGG GAG GCA GAA GTG TCA CAA GAG GTT. CGT . GTC TGT CCA GGT CGT GGC 3120 
Trp Glu Ala Glu Val Ser Gin Glu Val Arg Val Cys Pro Gly Arg Gly 
1025 1030 1035 1040 

TAT ATC CTT CGT GTC ACA GCA TAT AAA GAG GGA TAT GGA GAG GGC TGC 3168 
Tyr He Leu Arg Val Thr Ala Tyr Lys Glu Gly Tyr Gly Glu Gly Cys 
1045 1050 1055 

GTA ACG ATC CAT GAG ATC GAA GAC AAT ACA GAC GAA CTG AAA TTC AGC 3216 
Val Thr He His Glu He Glu Asp Asn Thr Asp Glu Leu Lys Phe Ser 
1060 1055 1070 

AAC TGT GTA GAA GAG GAA GTA TAT CCA AAC AAC ACA GTA ACG TGT AAT 32 64 
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Asn Cys Val Glu Glu Glu Val Tyr Pro Asn Asn Thr Val Thr Cys Asn 
1075 1080 1085 

AAT TAT ACT GGG ACT CAA GAA GAA TAT GAG GGT ACG TAC ACT TCT CGT 3312 
Asn Tyr Thr Gly Thr Gin Glu Glu Tyr Glu Gly Thr Tyr Thr Ser Arg 
1090 1095 1100 

AAT CAA GGA TAT GAC GAA GCC TAT GGT AAT AAC CCT TCC GTA CCA GCT 3360 
Asn Gin Gly Tyr Asp Glu Ala Tyr Gly Asn Asn Pro Ser Val Pro Ala 
1105 1110 1115 1120 

GAT TAC GCT TCA GTC TAT GAA GAA AAA TCG TAT AC A GAT GGA CGA AGA 3408 
Asp Tyr Ala Ser Val Tyr Glu Glu Lys Ser Tyr Thr Asp Gly Arg Arg 
1125 1130 1135 

GAG AAT CCT TGT GAA TCT AAC AGA GGC TAT GGG GAT TAC ACA CCA CTA 3456 
Glu Asn Pro Cys Glu Ser Asn Arg Gly Tyr Gly Asp Tyr Thr Pro Leu 
1140 1145 1150 



CCG GCT GGT TAT GTA ACA AAG GAT TTA GAG TAC TTC CCA GAG ACC GAT 3504 
Pro Ala Gly Tyr Val Thr Lys Asp Leu Glu Tyr Phe Pro Glu Thr Asp 
1155 1160 1165 

AAG GTA TGG ATT GAG ATC GGA GAA ACA GAA GGA ACA TTC ATC GTG GAT 3552 
Lys Val Trp He Glu He Gly Glu Thr Glu Gly Thr Phe He Val Asp 
1170 1175 1180 

AGC GTG GAA TTA CTC CTT ATG GAG GAA 3579 
Ser Val Glu Leu Leu Leu Met Glu Glu 
1185 1190 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1193 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Met Asp Asn Asn Pro Asn He Asn Glu Cys He Pro Tyr Asn Cys Leu 
15 10 15 

Ser Asn Pro Glu Val Glu Val Leu Gly Gly Glu Arg He Glu Thr Gly 
■ 20 25 30 

Tyr Thr Pro He Asp He Ser Leu Ser Leu Thr Gin Phe Leu Leu Ser 
35 40 45 

Glu Phe Val Pro Gly Ala Gly Phe Val Leu Gly Leu Val Asp He He 
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50 55 60 

Trp Gly lie Phe Gly Pro Ser Gin Trp Asp Ala Phe Leu Val Gin lie 
65 70 75 80 

Glu Gin Leu lie Asn Gin Arg lie Glu Glu Phe Ala Arg Asn Gin Ala 
85 90 95 

lie Ser Arg Leu Glu Gly Leu Ser Asn Leu Tyr Gin lie Tyr Ala Glu 
100 105 110 

Ser Phe Arg Glu Trp Glu Ala Asp Pro Thr Asn Pro Ala Leu Arg Glu 
115 120 125 

Glu Met Arg lie Gin Phe Asn Asp Met Asn Ser Ala^Leu Thr Thr Ala 
130 135 140 

lie Pro Leu Phe Ala Val Gin Asn Tyr Gin Val Pro Leu Leu Ser Val 
145 150 155 160 

Tyr Val Gin Ala Ala Asn Leu His Leu Ser Val Leu Arg Asp Val Ser 
165 170 175 

Val Phe Gly Gin Arg Trp. Gly Phe Asp Ala Ala Thr lie Asn Ser Arg 
180 185 190 

Tyr Asn Asp Leu Thr Arg Leu lie Gly Asn Tyr Thr Asp His Ala Val 
195 200 205 

Arg Trp Tyr Asn Thr Gly Leu Glu Arg Val Trp Gly Pro Asp Ser Arg 
210 215 220 

Asp Trp lie Arg Tyr Asn Gin Phe Arg Arg Glu Leu Thr Leu Thr Val 
225 230 235 240 

Leu Asp lie Val Ser Leu Phe Pro Asn Tyr Asp Ser Arg Thr Tyr Pro 
245 250 255 

lie Arg Thr Val Ser Gin Leu Thr Arg Glu lie Tyr Thr Asn Pro Val 
260 265 ' 270. 

Leu Glu Asn Phe Asp Gly Ser Phe Arg Gly Ser Ala Gin Gly lie Glu 
275 280 285 

Gly Ser lie Arg Ser Pro His Leu Met Asp lie Leu Asn Ser lie Thr 
290 295 300 

lie Tyr Thr Asp Ala His Arg Gly Glu Tyr Tyr Trp Ser Gly His Gin 
305 310 315 320 

lie Met Ala Ser Pro Val Gly Phe Ser Gly Pro Glu Phe Thr Phe Pro ' 
325 330 335 
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Leu Tyr Gly Thr Met Gly Asn Ala Ala Pro Gin Gin Arg lie Val Ala 
340 345 350 

Gin Leu Gly Gin Gly Val Tyr Arg Thr. Leu Ser Ser Thr Leu Tyr Arg 
355 360 365 

Arg Pro Phe Asn lie Gly lie Asn Asn Gin Gin Leu Ser Val Leu Asp 
370 375 380 

Gly Thr Glu Phe Ala Tyr Gly Thr Ser Ser Asn Leu Pro Ser Ala Val 
385 390 395 400 

Tyr Arg Lys Ser Gly Thr Val Asp Ser Leu Asp Glu lie Pro Pro Gin 
405 410 415 

Asn Asn Asn Val Pro Pro Arg Gin Gly Phe Ser His Arg Leu Ser His 
420 425 430 

Val Ser Met Phe Arg Ser Gly Phe Ser Asn Ser Ser Val Ser lie lie 
435 440 445 

Arg Ala Pro Met Phe Ser Trp lie His Arg Ser Ala Thr Leu Thr Asn 
450 455 460 

Thr lie Asp Pro Glu Arg lie Asn Gin lie Pro Leu Val Lys Gly Phe 
465 470 475 480 

Arg Val Trp Gly Gly Thr Ser Val lie Thr Gly Pro Gly Phe Thr Gly 
485 490 495 

Gly Asp lie Leu Arg Arg Asn Thr Phe Gly Asp Phe Val Ser Leu Gin 
500 505 510 

Val Asn lie Asn Ser Pro lie Thr Gin Arg Tyr Arg Leu Arg Phe Arg 
515 520 525 

Tyr Ala Ser Ser Arg Asp Ala Arg Val lie Val Leu Thr Gly Ala Ala 
530 535 540 

Ser Thr Gly Val Gly Gly Gin Val Ser Val Asn Met Pro Leu Gin Lys 
545 550 555 560 

Thr Met Glu lie Gly Glu Asn Leu Thr Ser Arg. Thr Phe Arg Tyr Thr 
565 570 575 

Asp Phe Ser Asn Pro Phe Ser Phe Arg Ala Asn Pro Asp lie lie Gly 
580 585 590 

lie Ser Glu Gin Pro Leu Phe Gly Ala Gly Ser lie Ser Ser Gly Glu 
595 600 . 605 

Leu Tyr lie Asp Lys lie Glu lie lie Leu Ala Asp Ala Thr Phe Glu 
610 615 620 
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Ala Glu Ser Asp Leu Glu Arg Ala Gin Lys Ala Val Asn Ala Leu Phe 
625 630 635 640 

Thr Ser Ser Asn Gin lie Gly Leu Lys Thr Asp Val Thr Asp Tyr His 
645 650 655 

lie Asp Gin Val Ser Asn Leu Val Asp Cys Leu Ser Asp Glu Phe Cys 
660 665 670 

Leu Asp Glu Lys Arg Glu Leu Ser Glu Lys Val Lys His Ala Lys Arg 
675 680 685 

Leu Ser Asp Glu Arg Asn Leu Leu Gin Asp Pro Asn Phe Arg Gly lie 
690 695 700- 

Asn Arg Gin Pro. Asp Arg Gly Trp Arg Gly Ser Thr Asp lie Thr lie. 
705 710 715 720 

Gin Gly Gly Asp Asp Val Phe Lys Glu Asn Tyr Val Thr Leu Pro Gly 
725 730 735 

Thr Val Asp Glu Cys Tyr Pro Thr Tyr Leu Tyr Gin Lys lie Asp Glu 
740 745 750 

Ser Lys Leu Lys Ala Tyr Thr Arg Tyr Glu Leu Arg Gly Tyr lie Glu 
755 760 765 

Asp Ser Gin Asp Leu Glu lie Tyr Leu lie Arg Tyr Asn Ala Lys His 
770 775 780 

Glu lie Val Asn Val Pro Gly Thr Gly Ser Leu Trp Pro Leu Ser Ala 
785 790 795 800 

Gin Ser Pro lie Gly Lys Cys Gly Glu Pro Asn Arg Cys Ala Pro His 
805 810 815 

Leu Glu Trp Asn Pro Asp Leu Asp Cys Ser Cys Arg Asp Gly Glu Lys 
820 825 830 

Cys Ala His His Ser His His Phe Thr Leu Asp lie Asp Val Gly Cys 
835 840 845 

Thr Asp Leu Asn Glu Asp Leu Gly Val Trp Val lie Phe Lys lie Lys 
850 855 860 ' ^ 

Thr Gin Asp Gly His Ala Arg Leu Gly Asn Leu Glu Phe Leu Glu Glu 
865 870 875 880 

Lys Pro Leu Leu Gly Glu Ala Leu Ala Arg Val Lys Arg Ala Glu Lys 
835 890 895 



Lys Trp Arg Asp Lys Arg Glu Lys Leu Gin Leu Glu Thr Asn lie Val 



f _ 
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900 905 910 

Tyr Lys Glu Ala Lys Glu Ser Val Asp Ala Leu Phe Val Asn Ser Gin 
915 920 925 

Tyr Asp Arg Leu Gin Val Asp Thr Asn lie Ala Met lie His Ala Ala 
930 935 940 

Asp Lys Arg Val His Arg lie Arg Glu Ala Tyr Leu Pro Glu Leu Ser 
945 950 955 960 

Val lie Pro Gly Val Asn Ala Ala lie Phe Glu Glu Leu Glu Gly Arg 
965 970 975 

lie Phe Thr Ala Tyr Ser Leu Tyr Asp Ala Arg Asn^al lie Lys Asn 
980 9 85 330 

Gly Asp Phe Asn Asn Gly Leu Leu Cys Trp Asn Val Lys Gly His Val 
995 1000 1005 



Asp Val Glu Glu Gin Asn Asn His 
1010 1015 

Trp Glu Ala Glu Val Ser Gin Glu 
1025 1030 

Tyr lie Leu Arg Val Thr Ala Tyr 
1045 

Val Thr lie His Glu lie Glu Asp 
1060 

Asn Cys Val Glu Glu 
1075 

Asn Tyr Thr Gly Thr Gin Glu Glu 
1090 1095 

Asn Gin Gly Tyr Asp Glu Ala Tyr 
1105 1110 



Arg Ser Val Leu Val lie Pro Glu 
1020 

Val Arg Val Cys Pro Gly Arg Gly 
1035 1040 

Lys Glu Gly Tyr Gly Glu Gly Cys 
1050 1055 

Asn Thr Asp Glu Leu Lys Phe Ser 
1065 1070 

Val Thr Cys Asn 
1085 

Tyr Glu Gly Thr Tyr Thr Ser Arg 
1100 

Gly Asn Asn Pro Ser Val Pro Ala 
1115 1120 



Glu Val Tyr Pro Asn Asn Thr 
1080 



Asp Tyr Ala Ser Val Tyr Glu Glu Lys Ser Tyr Thr Asp Gly Arg Arg 
1125 1130 1135 

Glu Asn Pro Cys Glu Ser Asn Arg Gly Tyr Gly Asp Tyr Thr Pro Leu 
1140 1145 1150 

Pro Ala Gly Tyr Val Thr Lys Asp Leu Glu Tyr Phe Pro Glu Thr Asp 
1155 1160 1165 



Lys Val Trp lie Glu lie Gly Glu Thr Glu Gly Thr Phe lie Val Asp 
1170 ' 1175 1180 




130-4080/PCT/CIP 



Ser Val Glu Leu Leu Leu Met Glu Glu 
1185 1190 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3468 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGAT^ISM: Sacillus thuringiensis 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1. .3468 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

ATG AAT CAA AAT AAA CAC GGA ATT ATT GGC GCT TCC AAT TGT GGT TGT 48 
Met Asn Gin Asn Lys His Gly He He Gly Ala Ser Asn Cys Gly Cys 
15 10 15 

GCA TCT GAT GAT GTT GCG AAA TAT CCT TTA GCC AAC AAT CCA TAT TCA 96 
Ala Ser Asp Asp Val Ala Lys Tyr Pro Leu Ala Asn Asn Pro Tyr Ser 
20 25 30 

TCT GCT TTA AAT TTA AAT TCT TGT CAA AAT AGT AGT ATT CTC AAC TGG 144 
Ser Ala Leu Asn Leu Asn Ser Cys Gin Asn Ser Ser He Leu Asn Trp 
35 40 45 

ATT AAC ATA ATA GGC GAT GCA GCA AAA GAA GCA GTA TCT ATT GGG ACA 192 
He Asn He He Gly Asp Ala Ala Lys Glu Ala Val Ser He Gly Thr 
50 55 60 

ACC ATA GTC TCT CTT ATC ACA GCA CCT TCT CTT ACT GGA TTA ATT TCA' 240 
Thr He Val Ser Leu He Thr Ala Pro Ser Leu Thr Gly Leu He Ser 
65 70 75 80 

ATA GTA TAT GAC CTT ATA GGT AAA GTA CTA GGA GGT AGT AGT GGA CAA 288 
He Val Tyr Asp Leu He Gly Lys Val Leu Gly Gly Ser Ser Gly Gin 
85 90 95 

TCC ATA TCA GAT TTG TCT ATA TGT GAC TTA TTA TCT ATT ATT GAT TTA 33 6 

Ser He Ser Asp Leu Ser He Cys Asp Leu Leu Ser He He Asp Leu 
100 105 110 

CGG GTA AGT CAG AGT GTT TTA AAT GAT GGG ATT GCA GAT TTT AAT GGT 3 84 
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Arg Val Ser Gin Ser Val Leu Asn Asp Gly lie Ala Asp Phe Asn Gly 
115 120 125 

TCT GTA CTC TTA TAG AGG AAC TAT TTA GAG GCT CTG GAT AGC TGG AAT 432 
Ser Val Leu Leu Tyr Arg Asn Tyr Leu Glu Ala Leu Asp Ser Trp Asn 
130 135 140 

AAG AAT CCT AAT TCT GCT TCT GCT GAA GAA CTC CGT ACT CGT TTT AGA 480 
Lys Asn Pro Asn Ser Ala Ser Ala Glu Glu Leu Arg Thr Arg Phe Arg 
145 150 155 160 

ATC GCC GAG TCA GAA TTT ■ GAT AGA ATT TTA ACC CGA GGG TCT TTA ACG 528 
lie Ala Asp Ser Glu Phe Asp Arg lie Leu Thr Arg Gly Ser Leu Thr 
165 170 175 

AAT GOT GGC TCG TTA GCT AGA CAA AAT GCC CAA ATA TTA TTA TTA CCT 576 
Asn Gly Gly Ser Leu Ala Arg Gin Asn Ala Gin lie Leu Leu Leu Pro 

180 185 190 ^..^ 



TCT TTT GCG AGC GCT GCA TTT TTC CAT TTA TTA CTA CTA AGG GAT GCT 624 
Ser Phe Ala Ser Ala Ala Phe Phe His Leu Leu Leu Leu Arg Asp Ala 
195 200 205 

ACT AGA TAT GGC ACT AAT TGG GGG CTA TAC AAT GCT ACA CCT TTT ATA 672 
Thr Arg Tyr Gly Thr Asn Trp Gly Leu Tyr Asn Ala Thr Pro Phe lie 
210 215 220 

AAT TAT CAA TCA AAA CTA GTA GAG CTT ATT GAA CTA TAT ACT GAT TAT 720 
Asn Tyr Gin Ser Lys Leu Val Glu Leu He Glu Leu Tyr Thr Asp Tyr 
225 230 235 240 

TGC GTA CAT TGG TAT AAT CGA GGT TTC AAC GAA CTA AGA CAA CGA GGC 768 
Cys Val His Trp Tyr Asn Arg Gly Phe Asn Glu Leu Arg Gin Arg Gly 
245 250 255 

ACT AGT GCT ACA GCT TGG TTA GAA TTT CAT AGA TAT CGT AGA GAG ATG 816 
Thr Ser Ala Thr Ala Trp Leu Glu Phe His Arg Tyr Arg Arg Glu Met 
260 265 270 

ACA TTG ATG GTA TTA GAT ATA GTA GCA TCA TTT TCA AGT CTT GAT ATT 864 
Thr Leu Met Val Leu Asp He Val Ala Ser Phe Ser Ser Leu Asp He 
275 " 280^ 285 

ACT AAT TAC CCA ATA GAA ACA GAT TTT CAG TTG AGT AGG GTC ATT TAT 912 
Thr Asn Tyr Pro He Glu Thr Asp Phe Gin Leu Ser Arg Val He Tyr 
290 295 300 

ACA GAT CCA ATT GGT TTT GTA CAT CGT AGT AGT CTT AGG GGA GAA AGT 960 
Thr Asp Pro He Gly Phe Val His Arg Ser Ser Leu Arg Gly Glu Ser 
305 310 315 320 

TGG TTT AGC TTT GTT AAT AGA GCT AAT TTC TCA GAT TTA GAA AAT GCA 1008 
Trp Phe Ser Phe Val Asn Arg Ala Asn Phe Ser Asp Leu Glu Asn Ala 
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325 330 . 335 

ATA CCT AAT CCT AGA CCG TCT TGG TTT TTA AAT AAT ATG ATT ATA TCT 1056 
lie Pro Asn Pro Arg Pro Ser Trp Phe Leu Asn Asn Met lie lie Ser 
340 345 350 

ACT GGT TCA CTT ACA TTG CCG GTT AGC CCA AGT ACT GAT AGA GCG AGG 1104 
Thr Gly Ser Leu Thr Leu Pro Val Ser Pro Ser Thr Asp Arg Ala Arg 
355 360 365 

GTA TGG TAT GGA AGT CGA GAT CGA ATT TCC CCT GCT AAT TCA CAA TTT 1152 
Val Trp Tyr Gly Ser Arg Asp Arg lie Ser Pro Ala Asn Ser Gin Phe 
370 375 380 

ATT ACT GAA CTA ATC TCT GGA CAA CAT ACG ACT GCT^ACA CAA ACT ATT 1200 
He Thr Glu Leu He Ser Gly Gin His Thx" Thr Ala Thr Gin Thr He 
385 390 395 400 

TTA GGG CGA AAT ATA TTT AGA GTA GAT TCT CAA GCT TGT AAT TTA AAT 
Leu Gly Arg Asn He Phe Arg Val Asp Ser Gin Ala Cys Asn Leu Asn 
405 410 415 

GAT ACC ACA TAT GGA GTG AAT AGG GCG GTA TTT TAT CAT GAT GCG AGT 1296 
Asp Thr Thr Tyr Gly Val Asn Arg Ala Val Phe Tyr His Asp Ala Ser 
420 425 -430 

GAA GGT TCT CAA AGA TCC GTG TAG GAG GGG TAT ATT CGA ACA ACT GGG 1344 
Glu Gly Ser Gin Arg Ser Val Tyr Glu Gly Tyr He Arg Thr Thr Gly 
435 440 445 

ATA GAT AAC CCT AGA GTT CAA AAT ATT AAC ACT TAT TTA CCT GGA GAA 1392 
He Asp Asn Pro Arg Val Gin Asn He Asn Thr Tyr Leu Pro Gly Glu 
450 455 460 

AAT TCA GAT ATC CCA ACT CCA GAA GAC TAT ACT CAT ATA TTA AGC ACA 1440 
Asn Ser Asp He Pro Thr Pro Glu Asp Tyr Thr His He Leu Ser Thr 
465 470 475 480 

ACA ATA AAT TTA ACA GGA GGA CTT AGA CAA GTA GCA TCT AAT CGC CGT 1488 
Thr He Asn Leu Thr Gly Gly Leu Arg Gin Val Ala Ser Asn Arg Arg 
485 490 495 

TCA TCT TTA GTA ATG TAT GGT TGG ACA CAT AAA AGT CTG GCT CGT AAC 153 6 

Ser Ser Leu Val Met Tyr Gly Trp Thr His Lys Ser Leu Ala Arg Asn 
500 505 510 

AAT ACC ATT AAT CCA GAT AGA ATT ACA CAG ATA CCA TTG ACG AAG GTT 15 84 

Asn Thr He Asn Pro Asp Arg He Thr Gin He Pro Leu Thr Lys Val 
515 520 525 

GAT ACC CGA GGC AfA GGT GTT TCT TAT GTG AAT GAT CCA GGA TTT ATA 1632 
Asp Thr Arg Gly Thr Gly Val Ser Tyr Val Asn Asp Pro Gly Phe He 
530 535 540 
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GGA GGA GCT CTA CTT CAA AGO ACT GAC CAT GGT TCG CTT GGA GTA TTG 1680 
Gly Gly Ala Leu Leu Gin Arg Thr Asp His Gly Ser Leu Gly Val Leu 
545 550 ,555 560 

AGG GTC CAA TTT CCA CTT CAC TTA AGA CAA CAA TAT CGT ATT AG A GTC 1728 
Arg Val Gin Phe Pro Leu His Leu Arg Gin Gin Tyr Arg lie Arg Val 
565 570 575 

CGT TAT GCT TCT AC A AC A AAT ATT CGA TTG AGT GTG AAT GGC AGT TTC 1776 
Arg Tyr Ala Ser Thr Thr Asn lie Arg; Leu Ser Val Asn Gly Ser Phe 
580 585 590 

GGT ACT ATT TCT CAA AAT CTC CCT AGT ACA ATG AGA TTA GGA GAG GAT 1824 
Gly Thr lie Ser Gin Asn Leu Pro Ser Thr Met Arg^Leu Gly Glu Asp 



TTA AGA TAC GGA TCT TTT GCT ATA AGA GAG TTT AAT ACT TCT ATT AGA 187.?.. 
Leu Arg Tyr Gly Ser Phe, Ala lie Arg Glu Phe Asn Thr Ser lie Arg ' " % 

610 615 620 

CCC ACT GCA AGT CCG GAC CAA ATT CGA TTG ACA ATA GAA CCA TCT TTT 1920- 
Pro Thr Ala Ser Pro Asp Gin lie Arg Leu Thr lie Glu Pro Ser Phe ; 
625 630 635 640 

ATT AGA CAA GAG GTC TAT GTA GAT AGA ATT GAG TTC ATT CCA GTT AAT 1968 
lie Arg Gin Glu Val Tyr Val Asp Arg lie Glu Phe lie Pro Val Asn 
645 650 655 

CCG ACG CGA GAG GCG AAA GAG. GAT CTA GAA GCA GCA AAA AAA GCG GTG 2016 
Pro Thr Arg Glu Ala Lys Glu Asp Leu Glu Ala Ala Lys Lys Ala Val 
660 665 670 

GCG AGC TTG TTT ACA CGC ACA AGG GAC GGA TTA CAA GTA AAT GTG AAA 2064 
Ala Ser Leu Phe Thr Arg Thr Arg Asp Gly Leu Gin Val Asn Val Lys 
675 680 685 

GAT TAT CAA GTC GAT CAA GCG GCA AAT TTA GTG TCA TGC TTA TCA GAT 2112 
Asp Tyr Gin Val Asp Gin Ala Ala Asn Leu Val Ser Cys Leu Ser Asp 
690 695 700 

GAA CAA TAT GGG TAT GAC AAA AAG ATG TTA TTG GAA GCG GTA CGT GCG 2160 
Glu Gin Tyr Gly Tyr Asp Lys Lys Met Leu Leu Glu Ala Val Arg Ala 
705 710 715 720 

GCA AAA CGA CTT AGC CGA GAA CGC AAC TTA CTT CAG GAT CCA GAT TTT 2208 
Ala Lys Arg Leu Ser Arg Glu Arg Asn . Leu Leu Gin Asp Pro Asp Phe 
725 730 735 

AAT ACA ATC AAT AGT ACA GAA GAA AAT GGA TGG AAA GCA AGT AAC GGC 2256 
Asn Thr lie Asn Ser Thr Glu Glu Asu Gly Trp Lys Ala Ser Asn Gly 
740 745 750 
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GTT ACT ATT AGT GAG GGC GGG CCA TTC TAT AAA GGC CGT GCA ATT CAG 2304 
Val Thr lie Ser Glu Gly Gly Pro Phe Tyr Lys Gly Arg Ala lie Gin 
755 760 765 

CTA GCA AGT GCA CGA GAA AAT TAC CCA ACA TAC ATC TAT CAA AAA GTA 2352 
Leu Ala Ser Ala Arg Glu Asn Tyr Pro Thr Tyr lie Tyr Gin Lys Val 
770 775 780 

GAT GCA TCG GAG TTA AAG CCG TAT ACA CGT TAT AGA CTG GAT GGG TTC 2400 
Asp Ala Ser Glu Leu Lys Pro Tyr Thr Arg Tyr Arg Leu Asp Gly Phe 
785 790 795 800 

GTG AAG AGT. AGT CAA GAT TTA GAA ATT GAT CTC ATT CAC CAT CAT. AAA 2448 
Val Lys Ser Ser Gin Asp Leu Glu lie Asp Leu lie His His His Lys 
805 810 ^ 815 

GTC CAT CTT GTG AAA AAT GTA CCA GAT AAT TTA GTA TCT GAT ACT TAC 2496 

Val His Leu Val Lys Asn Val Pro Asp Asn Leu Val Ser Asp Thr Tyr 

820 825 830 ' S 

CCA GAT GAT TCT TGT AGT GGA ATC AAT CGA TGT CAG GAA CAA CAG ATG 2544 
Pro Asp Asp Ser Cys Ser Gly lie Asn Arg Cys Gin Glu Gin Gin Met 
835 840 845 

GTA AAT GCG CAA CTG GAA ACA GAG CAT CAT CAT CCG ATG GAT TGC TGT 2592 
Val Asn Ala Gin Leu Glu Thr Glu His His His Pro Met Asp Cys Cys 
850 855 860 

GAA GCA GCT CAA ACA CAT GAG TTT TCT TCC TAT ATT GAT ACA GGG GAT 2640 
Glu Ala Ala Gin Thr His Glu Phe Ser Ser Tyr lie Asp Thr Gly Asp 
865 870 875 880 

TTA AAT TCG AGT GTA GAC CAG GGA ATC TGG GCG ATC TTT AAA GTT CGA 2688 
Leu Asn Ser Ser Val Asp Gin Gly lie Trp Ala lie Phe Lys Val Arg 
885 890 895 

ACA ACC GAT GGT TAT GCG ACG TTA GGA AAT CTT GAA TTG GTA GAG GTC 2736 
Thr Thr Asp Gly Tyr Ala Thr Leu Gly Asn Leu Glu Leu Val Glu Val 
900 905 910 

GGA CCG TTA TCG GGT GAA TCT TTA GAA CGT GAA CAA AGG .GAT AAT ACA • 2784 
Gly Pro Leu Ser Gly Glu Ser Leu Glu Arg Glu Gin Arg Asp Asn Thr 
915 • 920 925 

AAA TGG AGT GCA GAG CTA GGA AGA AAG CGT GCA GAA ACA GAT CGC GTG 2832 
Lys Trp Ser Ala Glu Leu Gly Arg Lys Arg Ala Glu Thr Asp Arg Val 
930 935 940 

TAT CAA GAT GCC AAA CAA TCC ATC AAT CAT TTA TTT GTG GAT TAT CAA 2880 
Tyr Gin Asp Ala Lys Gin Ser lie Asn His Leu Phe Val Asp Tyr Gin 
945 950 955 960 

GAT CAA CAA TTA AAT CCA GAA ATA GGG ATG GCA GAT ATT ATG GAC GCT 2928 
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Asp Gin Gin Leu Asn Pro Glu lie Gly Met Ala Asp lie Met Asp Ala 
965 970 975 

CAA AAT CTT GTC GCA TCA ATT TCA GAT GTA TAT AGC GAT GCC GTA CTG 2976 
Gin Asn Leu Val Ala Ser lie Ser Asp Val Tyr Ser Asp Ala Val Leu 
980 985 990 

CAA ATC CCT GGA ATT AAC TAT GAG ATT TAG AC A GAG CTG TCC AAT CGC 3024 
Gin lie Pro Gly He Asn Tyr Glu He Tyr Thr Glu Leu Ser Asn Arg 
995 1000 1005 

TTA CAA CAA GCA TCG TAT CTG TAT ACG TCT CGA AAT GCG GTG CAA AAT 3072 
Leu Gin Gin Ala Ser Tyr Leu Tyr Thr Ser Arg Asn Ala Val Gin Asn 
1010 1015 1020 

GGG GAC TTT AAC AAC GGG CTA GAT AGC TGG AAT GCA ACA GCG GGT GCA 3120 
Gly Asp Phe Asn Asn Gly. Leu Asp Ser Trp Asn Ala Thr Ala Gly Ala 
1025 1030 1035 1040 

TCG GTA CAA CAG GAT GGC AAT ACG CAT TTC TTA GTT CTT TCT CAT TGG 3168 
Ser Val Gin Gin Asp Gly Asn Thr His Phe Leu Val Leu Ser .His Trp 
1045 1050 1055 

GAT GCA CAA GTT TCT CAA CAA TTT AGA GTG CAG CCG AAT TGT AAA TAT 3216 
Asp Ala Gin Val Ser Gin Gin Phe Arg Val Gin Pro Asn Cys Lys Tyr 
1060 1065 1070 

GTA TTA CGT GTA ACA GCA GAG AAA GTA GGC GGC GGA GAC GGA TAG GTG 3264 
Val Leu Arg Val Thr Ala Glu Lys Val Gly Gly Gly Asp Gly Tyr Val 
1075 1080 1085 

ACT ATC CGG GAT GAT GCT CAT CAT ACA GAA ACG CTT ACA TTT AAT GCA 3312 
Thr He Arg Asp Asp Ala His His Thr Glu Thr Leu Thr Phe Asn Ala 
1090 1095 1100 

TGT GAT TAT GAT ATA AAT GGC ACG TAG GTG ACT GAT AAT ACG TAT CTA 3360 
Cys Asp Tyr Asp He Asn Gly Thr Tyr Val Thr Asp Asn Thr Tyr Leu 
1105 1110 1115 1120 

ACA AAA GAA GTG GTA TTC CAT CCG GAG ACA CAA CAC ATG TGG GTA GAG 3408 
Thr Lys Glu Val Val Phe His Pro Glu Thr Gin His Met Trp Val Glu 
1125 1130 1135 

GTA AAT GAA ACA GAA GGT GCA TTT CAT ATA GAT AGT ATT GAA TTC GTT 3456 
Val Asn Glu Thr Glu Gly Ala Phe His He Asp Ser He Glu Phe Val 
1140 1145 1150 

GAA ACA GAA AAG 3468 
Glu Thr Glu Lys 
1155 



(2) INFORMATION FOR SEQ ID NO: 10: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1156 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Met Asn Gin Asn Lys His Gly lie lie Gly Ala Ser Asn Cys Gly Cys 
1, 5 10 15 

Ala Ser Asp Asp Val Ala Lys Tyr Pro Leu Ala Asn Asn Pro Tyr Ser 
20 25 ^ 30 

Ser Ala Leu Asn Leu Asn Ser Cys Gin Asn Ser Ser lie Leu Asn Trp 
35 . 40 . 45 

lie Asn lie lie Gly Asp Ala Ala Lys Glu Ala Val Ser lie Gly Thr 
50 55 60 

Thr lie Val Ser Leu lie Thr Ala Pro Ser Leu Thr Gly Leu lie Ser 
65 70 75 80 

lie Val Tyr Asp Leu lie Gly Lys Val Leu Gly Gly Ser Ser Gly Gin 
85 90 95 

Ser lie Ser Asp Leu Ser lie Cys Asp Leu Leu Ser lie lie Asp Leu 
100 105 110 

Arg Val Ser Gin Ser Val Leu Asn Asp Gly lie Ala Asp Phe Asn Gly 
115 120 125 



Ser Val Leu Leu Tyr Arg Asn Tyr 
130 135 

Lys Asn Pro Asn Ser Ala Ser Ala 
145 150 

lie Ala Asp Ser Glu Phe Asp Arg 
165 

Asn Gly Gly Ser Leu Ala Arg Gin 
180 

Ser Phe Ala Ser Ala Ala Phe Phe 
195 200 

Thr Arg Tyr Gly Thr Asn Trp Gly 
210 ^ 215 

Asn Tyr Gin Ser Lys Leu Val Glu 



Leu Glu Ala Leu Asp Ser Trp Asn 
140 

Glu Glu Leu Arg Thr Arg Phe Arg 
155 160 

lie Leu Thr Arg Gly Ser Leu Thr 
170 175 

Asn Ala Gin lie Leu Leu Leu Pro 
185 190 

His Leu Leu Leu Leu Arg Asp Ala 
205 

Leu Tyr Asn Ala Thr Pro Phe lie 
220 

Leu lie Glu Leu Tyr Thr Asp Tyr 
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225 230 235 240 

Gys Val His Trp Tyr Asn Arg Gly Phe Asn Glu Leu Arg Gin Arg Gly 
245 250 255 

Thr Ser Ala Thr Ala Trp Leu Glu Phe His Arg Tyr Arg Arg Glu Met 
260 265 270 

Thr . Leu Met Val Leu Asp lie Val Ala Ser Phe Ser Ser Leu Asp lie 
275 280 285 

Thr Asn Tyr Pro lie Glu Thr Asp Phe Gin Leu Ser Arg Val lie Tyr 
290 295 300 

Thr Asp Pro lie Gly Phe Val His Arg Ser Ser Leu Arg Gly Glu Ser 
305 310 315 320 

Trp Phe Ser Phe Val Asn Arg Ala Asn Phe Ser Asp Leu Glu Asn Ala 
325 330 335 

lie Pro Asn Pro Arg Pro Ser Trp Phe Leu Asn Asn Met He He Ser 
340 345 350 

Thr Gly Ser Leu Thr Leu Pro Val Ser Pro Ser Thr Asp Arg Ala Arg 
355 360 365 

Val Trp Tyr Gly Ser Arg Asp Arg He Ser Pro Ala Asn Ser Gin Phe 
370 375 380 

He Thr Glu Leu He Ser Gly Gin His Thr Thr Ala Thr Gin Thr He 
385 390 395 400 

Leu Gly Arg Asn He Phe Arg Val Asp Ser Gin Ala Cys Asn Leu Asn 
405 410 415 

Asp Thr Thr Tyr Gly Val Asn Arg Ala Val Phe Tyr His Asp Ala Ser 
420 ' 425 430 

Glu Gly Ser Gin Arg Ser Val Tyr Glu Gly Tyr He Arg Thr Thr Gly 
435 440 445 

He Asp Asn Pro Arg Val Gin Asn He Asn Thr Tyr Leu Pro Gly Glu 
450 455 460 

Asn Ser Asp He Pro Thr Pro Glu Asp Tyr Thr His He Leu Ser Thr 
465 470 475 480 

Thr He Asn Leu Thr Gly Gly Leu Arg Gin Val Ala Ser Asn Arg Arg 
485 490 495 

Ser Ser Leu Val Met Tyr Gly Trp Thr His Lys Ser Leu Ala Arg Asn 
500 505 510 
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Asn Thr lie Asn Pro Asp Arg lie Thr Gin lie Pro Leu Thr Lys Val 

515 520 525 

Asp Thr Arg Gly Thr Gly Val Ser Tyr Val Asn Asp Pro Gly Phe lie 
530 535 540 

Gly Gly Ala Leu Leu Gin Arg Thr Asp His Gly Ser Leu Gly Val Leu 
545 550 555 560 

Arg Val Gin Phe Pro Leu His Leu Arg Gin Gin Tyr Arg lie Arg Val 
565 570 575 

Arg Tyr Ala Ser Thr Thr Asn lie Arg Leu Ser Val Asn Gly Ser Phe 
580 585 590 

Gly Thr lie Ser Gin Asn Leu Pro Ser Thr Met Arg Leu Gly Glu Asp 

595 600 605 



Leu Arg Tyr Gly Ser Phe Ala lie Arg Glu Phe Asn Thr Ser lie Arg 
610 615 620 

o 

Pro Thr Ala Ser Pro Asp Gin lie Arg Leu Thr lie Glu Pro Ser Phe 
625 630 635 640 



He Arg Gin Glu Val Tyr Val Asp Arg He Glu Phe He Pro Val Asn 
% 645 650 655 

Pro Thr Arg Glu Ala Lys Glu Asp Leu Glu Ala Ala Lys Lys Ala Val 
660 665 670 

Ala Ser Leu Phe Thr Arg Thr Arg Asp Gly Leu Gin Val Asn Val Lys 
675 680 685 

flJ 

m i; 

5^ Asp Tyr Gin Val Asp Gin Ala Ala Asn Leu Val Ser Cys Leu Ser Asp 
O 690 695 700 

Glu Gin Tyr Gly Tyr Asp Lys Lys Met Leu Leu Glu Ala Val Arg Ala 
705 710 715 720 

Ala Lys Arg Leu Ser Arg Glu Arg Asn Leu Leu Gin Asp Pro Asp Phe 
725 730 735 

Asn Thr He Asn Ser Thr Glu Glu Asn Gly Trp Lys Ala Ser Asn Gly 
740 745 750 



Val Thr He Ser Glu Gly Gly Pro Phe Tyr Lys Gly Arg Ala He Gin 
755 760 765 

Leu Ala Ser Ala Arg Glu Asn Tyr Pro Thr Tyr He Tyr Gin Lys Val 

770 775 780 

Asp Ala Ser Glu Leu Lys Pro Tyr Thr Arg Tyr Arg Leu Asp Gly Phe 

785 790 795 800 
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Val Lys Ser Ser Gin Asp Leu Glu lie Asp Leu lie His His His Lys 
805 810 815 

Val His Leu Val Lys Asn Val Pro Asp Asn Leu Val Ser Asp Thr Tyr 
820 825 830 

Pro Asp Asp Ser Cys Ser Gly lie Asn Arg Cys Gin Glu Gin Gin Met 
835. 840 845 

Val Asn Ala Gin Leu Glu Thr Glu His His His Pro Met Asp Cys Cys 
850 855 860 

Glu Ala Ala Gin Thr His Glu Phe Ser Ser Tyr lie Asp Thr Gly Asp 
865 870 875 ^ 880 

Leu. Asn Ser Ser Val Asp Gin Gly lie Trp Ala lie Phe Lys Val Arg 
885 890 . 895 

Thr Thr Asp Gly Tyr Ala Thr Leu Gly Asn Leu Glu Leu Val Glu Val 
900 905 910 

Gly Pro Leu Ser Gly Glu Ser Leu Glu Arg Glu Gin Arg Asp Asn Thr 
915 920 925 

.Lys Trp Ser Ala Glu Leu Gly Arg Lys Arg Ala Glu Thr Asp Arg Val 
930 935 940 

Tyr Gin Asp Ala Lys Gin Ser lie Asn His Leu Phe Val Asp Tyr Gin 
945 950 955 960 

Asp Gin Gin Leu Asn Pro Glu lie Gly Met Ala Asp lie Met Asp Ala 
965 970 975 

Gin Asn Leu Val Ala Ser lie Ser Asp Val Tyr Ser Asp Ala Val Leu 
980 985 990 

Gin lie Pro Gly lie Asn Tyr Glu lie Tyr Thr Glu Leu Ser Asn Arg 
995 1000 1005 

Leu Gin Gin Ala Ser Tyr' Leu Tyr Thr Ser Arg Asn Ala Val Gin Asn 
1010 1015 1020 

Gly Asp Phe Asn Asn Gly Leu Asp Ser Trp Asn Ala Thr Ala Gly Ala 
1025 ^ 1030 1035 1040 

Ser Val Gin Gin Asp Gly Asn Thr His Phe Leu Val Leu Ser His Trp 
1045 1050 1055 

Asp Ala Gin Val Ser Gin Gin Phe Arg Val Gin Pro Asn Cys Lys Tyr 
1060 1065 1070 

Val Leu Arg Val Thr Ala Glu Lys Val Gly Gly Gly Asp Gly Tyr Val 
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1075 1080 1085 

Thr lie Arg Asp Asp Ala His His Thr Glu Thr Leu Thr Phe Asn Ala 
1090 1095 1100 

Cys Asp Tyr Asp lie Asn Gly Thr Tyr Val Thr Asp Asn Thr Tyr Leu 
1105 1110 1115 1120 

Thr Lys Glu Val Val Phe His Pro Glu Thr Gin His Met Trp Val Glu 
1125 1130 1135 

Val Asn Glu Thr Glu Gly Ala Phe His lie Asp Ser He Glu Phe Val 
1140 1145 1150 

Glu Thr Glu Lys \ 
1155 



(2) INFORMATION FOR SEQ ID. NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3726 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1. .3726 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

ATG AAT CAA AAT AAA CAC GGA ATT ATT GGC GCT TCC AAT TGT GGT TGT 48 

Met Asn Gin Asn Lys His Gly He He Gly Ala Ser Asn Cys Gly Cys 

15 10 15 

GCA TCT GAT GAT GTT GCG AAA TAT CCT TTA GCC AAC AAT CCA TAT TCA 96 

Ala Ser Asp Asp Val Ala Lys Tyr Pro Leu Ala Asn Asn Pro Tyr Ser 

20 25. 30 

TCT GCT TTA AAT TTA AAT TCT TGT CAA AAT AGT AGT ATT CTC AAC TGG 144 

Ser Ala Leu Asn Leu Asn Ser Cys Gin Asn Ser Ser He Leu Asn Trp 
35 40 45 

ATT AAC ATA ATA GGC GAT GCA GCA AAA GAA GCA GTA TCT ATT GGG ACA 192 

He Asn He He Gly Asp Ala Ala Lys Glu Ala Val Ser He Gly Thr 
50 55 60 

ACC ATA GTC TCT CTT ATC ACA GCA CCT TCT CTT ACT GGA TTA ATT TCA 240 
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Thr lie Val Ser Leu lie Thr Ala Pro Ser Leu Thr Gly Leu lie Ser 
65 70 75 80 

ATA GTA TAT GAG CTT ATA GGT AAA GTA CTA GGA GGT AGT AGT GGA CAA 288 
lie Val Tyr Asp Leu lie Gly Lys Val Leu Gly Gly Ser Ser Gly Gin 
85 90 95 

TCC ATA TCA GAT TTG TOT ATA TGT GAG TTA TTA TOT ATT ATT GAT TTA 336 
Ser lie Ser Asp Leu Ser lie Cys Asp Leu Leu Ser lie lie Asp Leu 
100 105 110 

CGG GTA AGT GAG AGT GTT TTA AAT GAT GGG ATT GCA GAT TTT AAT GGT 384 
Arg Val Ser Gin Ser Val Leu Asn Asp Gly lie Ala Asp Phe Asn Gly 
115 120 125 

TOT GTA CTC TTA TAG AGG AAC TAT TTA GAG GOT CTG GAT AGO TGG AAT 432 
Ser Val Leu Leu Tyr Arg Asn Tyr Leu Glu Ala Leu Asp Ser Trp Asn 
130 135 140 

AAG AAT COT AAT TCT GGT TCT GCT GAA GAA CTC CGT ACT CGT TTT AGA 480 
Lys Asn Pro Asn Ser Ala Ser Ala Glu Glu Leu Arg Thr Arg Phe Arg 
145 150 155 160 

ATC GCC GAC TCA GAA TTT GAT AGA ATT TTA ACC CGA GGG TCT TTA ACG 528 
lie Ala Asp Ser Glu Phe Asp Arg lie Leu Thr Arg Gly Ser Leu Thr 
165 170 175 

AAT GGT GGG TCG TTA GCT AGA CAA AAT GCC CAA ATA TTA TTA TTA CGT 576 
Asn Gly Gly Ser Leu Ala Arg Gin Asn Ala Gin lie Leu Leu Leu Pro 
180 185 190 

TCT TTT GCG AGG GCT GCA TTT TTC CAT TTA TTA CTA CTA AGG GAT GCT 624 
Ser Phe Ala Ser Ala Ala Phe Phe His Leu Leu Leu Leu Arg Asp Ala 
195 200 205 

ACT AGA TAT GGC ACT AAT TGG GGG CTA TAC AAT GCT ACA CGT TTT ATA 672 
Thr Arg Tyr Gly Thr Asn Trp Gly Leu Tyr Asn Ala Thr Pro Phe lie 
210 215 220 

AAT TAT CAA TCA AAA CTA GTA GAG CTT ATT GAA CTA TAT ACT GAT TAT 720 
Asn Tyr Gin Ser Lys Leu Val Glu Leu lie Glu Leu Tyr Thr Asp Tyr 
225 230 235 240 

TGC GTA CAT TGG TAT AAT CGA GGT TTC AAC GAA CTA AGA CAA CGA GGC 7 68 

Cys Val His Trp Tyr Asn Arg Gly Phe Asn Glu Leu Arg Gin Arg Gly 
245 250 255 

ACT AGT GCT ACA GCT TGG TTA GAA TTT CAT AGA TAT CGT AGA GAG ATG 816 
Thr Ser Ala Thr Ala Trp Leu Glu Phe His Arg Tyr Arg Arg Glu Met 
260 265 270 

ACA TTG ATG GTA TTA GAT ATA GTA GCA TCA TTT TCA AGT CTT GAT ATT 864 
Thr Leu Met Val Leu Asp lie Val Ala Ser Phe Ser Ser Leu Asp lie 
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275 



280 



285 



ACT AAT TAC CCA ATA GAA ACA GAT TTT CAG TTG AGT AGG GTC ATT TAT 
Thr Asn Tyr Pro lie Glu Thr Asp Phe Gin Leu Ser Arg Val lie Tyr 
290 295 300 



912 



ACA GAT CCA ATT GGT TTT GTA CAT CGT AGT AGT CTT AGG GGA GAA AGT 
Thr Asp Pro lie Gly Phe Val His Arg Ser Ser Leu Arg Gly Glu Ser 
305 310 315 320 



960 



TGG TTT AGC TTT GTT AAT AG A GCT AAT TTC TCA GAT TTA GAA AAT GCA 
Trp Phe Ser Phe Val Asn Arg Ala Asn Phe Ser Asp Leu Glu Asn Ala 
325 330 335 



1008 



ATA CCT AAT CCT AGA CCG TCT TGG TTT TTA AAT AATSaTG ATT ATA TCT 
lie Pro Asn Pro Arg Pro Ser Trp Phe Leu Asn Asn Met lie lie Ser 
340 345 350 



1056 



ACT GGT TCA CTT ACA TTG CCG GTT AGC CCA AGT ACT GAT AGA GCG AGG lltM 
Thr Gly Ser Leu Thr Leu Pro Val Ser Pro Ser Thr Asp Arg Ala Arg 
355 360 365 

GTA TGG TAT GGA AGT CGA GAT CGA ATT TCC CCT GCT AAT TCA CAA TTT 1152 
Val Trp Tyr Gly Ser Arg Asp Arg lie Ser Pro Ala Asn Ser Gin Phe 
370 375 380 



ATT ACT GAA CTA ATC TCT GGA CAA CAT ACG ACT GCT ACA CAA ACT ATT 
lie Thr Glu Leu lie Ser Gly Gin His Thr Thr Ala Thr Gin Thr lie 
385 390 395 400 



1200 



TTA GGG CGA AAT ATA TTT AGA GTA GAT TCT CAA GCT TGT AAT TTA AAT 
Leu Gly Arg Asn lie Phe Arg Val Asp Ser Gin Ala Cys Asn Leu Asn 
405 410 415 



1248 



GAT ACC ACA TAT GGA GTG AAT AGG GCG GTA TTT TAT CAT GAT GCG AGT 
Asp Thr Thr Tyr Gly Val Asn Arg Ala Val Phe Tyr His Asp Ala Ser 
420 425 430 



1296 



GAA GGT TCT CAA AGA TCC GTG TAC GAG GGG TAT ATT CGA ACA ACT GGG 
Glu Gly Ser Gin Arg Ser Val Tyr Glu Gly Tyr lie Arg Thr Thr Gly 
435 440 445 



1344 



ATA GAT AAC CCT AGA GTT CAA AAT ATT AAC ACT TAT TTA CCT GGA GAA 
lie Asp Asn Pro Arg Val Gin Asn lie Asn Thr Tyr Leu Pro Gly Glu 
450 455 460 



1392 



AAT TCA GAT ATC CCA ACT CCA GAA GAC TAT ACT CAT ATA TTA AGC ACA 
Asn Ser Asp lie Pro Thr Pro Glu Asp Tyr Thr His lie Leu Ser Thr 
465 470 475 480 



1440 



ACA ATA AAT TTA AGA GGA GGA CTT AGA CAA GTA GCA TCT AAT CGC CGT 
Thr lie Asn Leu Thr Gly Gly Leu Arg Gin Val Ala Ser Asn Arg Arg 
485 490 . 495 



1488 
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TCA TCT TTA GTA ATG TAT GGT TGG AC A CAT AAA AGT CTG GCT CGT AAC 1536 
Ser Ser Leu Val Met Tyr Gly Trp Thr His Lys Ser Leu Ala Arg Asn 
500 505 510 

AAT ACC ATT AAT CCA GAT AGA ATT ACA CAG ATA CCT TTA GTG AAA GGA 1584 
Asn Thr lie Asn Pro Asp Arg lie Thr Gin lie Pro Leu Val Lys Gly 
515- 520 525 

TTT AGA GTT TGG GGG GGC ACC TCT GTC ATT ACA GGA CCA GGA TTT ACA 1632 
Phe Arg Val Trp Gly Gly Thr Ser Val lie Thr Gly Pro Gly Phe Thr 
530 535 540 

GGA GGG GAT ATC CTT CGA AGA AAT ACC TTT GGT GAT TTT GTA TCT CTA 1680 
Gly Gly Asp lie Leu Arg Arg Asn Thr Phe Gly Asp -Phe Val Ser Leu 
545 550 555 560 

CAA GTC AAT ATT AAT TCA CCA ATT ACC CAA AGA TAC CGT TTA AGA TTT 17.^3 
Gin Val Asn lie Asn Ser Pro lie Thr Gin Arg Tyr Arg Leu Arg Phe 
565 570 575 

CGT TAC GCT TCC AGT AGG GAT GCA CGA GTT ATA GTA TTA ACA GGA GCG 1776 
Arg Tyr Ala Ser Ser Arg Asp Ala Arg Val lie Val Leu Thr Gly Ala 
580 585 - 590 

GCA TCC ACA GGA GTG GGA GGC CAA GTT AGT GTA AAT ATG CCT CTT CAG 1824 
Ala Ser Thr Gly Val Gly Gly Gin Val Ser Val Asn Met Pro Leu Gin 
595 600 605 

AAA ACT ATG GAA ATA GGG GAG AAC TTA ACA TCT AGA ACA TTT AGA TAT 1872 
Lys Thr Met Glu lie Gly Glu Asn Leu Thr Ser Arg Thr Phe Arg Tyr 
610 615 620 

ACC GAT TTT AGT AAT CCT TTT TCA TTT AGA GCT AAT CCA GAT ATA ATT 192 0 

Thr Asp Phe Ser Asn Pro Phe Ser Phe Arg Ala Asn Pro Asp lie lie 
625 630 635 640 

GGG ATA AGT GAA CAA CCT CTA TTT GGT GCA GGT TCT ATT AGT AGC GGT 1968 
Gly lie Ser Glu Gin Pro Leu Phe Gly Ala Gly Ser lie Ser Ser Gly 
645 650 655 

GAA CTT TAT ATA GAT AAA ATT GAA ATT ATT CTA GCA GAT GCA ACA TTT 2016 
Glu Leu Tyr lie Asp Lys lie Glu lie lie Leu Ala Asp Ala Thr Phe 
660 665 670 

GAA GCA GAA TCT GAT TTA GAA AGA GCA CAA AAG GCG GTG AAT GCC CTG 2064 
Glu Ala Glu Ser Asp Leu Glu A.rg Ala Gin Lys Ala Val Asn Ala Leu 
675 680 685 

TTT ACT TCT TCC AAT CAA ATC GGG TTA AAA ACC GAT GTG ACG GAT TAT 2112 
Phe Thr Ser Ser Asn, Gin lie Gly Leu Lys Thr Asp Val Thr Asp Tyr 
690 695 700 
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CAT ATT GAT CAA GTA TCC AAT TTA GTG GAT TGT TTA TCA GAT GAA TTT 2160 
His lie Asp Gin Val Ser Asn Leu Val Asp Cys Leu Ser Asp Glu Phe 
705 710 715 720 

TGT CTG GAT GAA AAG CGA GAA TTG TCC GAG AAA GTC AAA CAT GCG AAG 2208 
Cys Leu Asp Glu Lys Arg Glu Leu Ser Glu Lys Val Lys His Ala Lys 
725 730 735 

CGA CTC AGT GAT GAG CGG AAT TTA CTT CAA GAT CCA AAC TTC AGA GGG 2256 
Arg Leu Ser Asp Glu Arg Asn Leu Leu Gin Asp Pro Asn Phe Arg Gly 
740 745 ^ 750 

ATC AAT AGA CAA CCA GAC CGT GGC TGG AGA GGA AGT ACA GAT ATT ACC 2304 
He Asn Arg Gin Pro Asp Arg Gly Trp Arg Gly Ser Thr Asp He Thr 
755 760 <765 

ATC CAA GGA GGA GAT GAC GTA TTC AAA GAG AAT TAC GTC ACA CTA CCG . 2352 
He Gin Gly Gly Asp Asp Val Phe Lys Glu Asn Tyr Val Thr Leu Pro 
770 775 780 

GGT ACC GTT GAT GAG TGC TAT CCA ACG TAT TTA TAT CAG AAA ATA GAT 2400 
Gly Thr Val Asp Glu Cys Tyr Pro Thr Tyr Leu Tyr Gin Lys He Asp 
785. 790 795 800 

GAG TCG AAA TTA AAA GCT TAT ACC CGT TAT GAA TTA AGA GGG TAT ATC 2448 
Glu Ser Lys Leu Lys Ala Tyr Thr Arg Tyr Glu Leu Arg Gly Tyr He 
805 810 815 

GAA GAT AGT CAA GAC TTA GAA ATC TAT TTG ATC CGT TAC AAT GCA AAA 2496 
Glu Asp Ser Gin Asp Leu Glu He Tyr Leu He Arg Tyr Asn Ala Lys 
820 825 830 

CAC GAA ATA GTA AAT GTG CCA GGC ACG GGT TCC TTA TGG CCG CTT TCA 2544 
His Glu He Val , Asn Val Pro Gly Thr Gly Ser Leu Trp Pro Leu Ser 
835 840 845 

GCC CAA AGT CCA ATC GGA AAG TGT GGA GAA CCG AAT CGA TGC GCG CCA 2592 
Ala Gin Ser Pro He Gly Lys Cys Gly Glu Pro Asn Arg Cys Ala Pro 
850 855 860 

CAC CTT GAA TGG AAT CCT GAT CTA GAT TGT TCC TGC. AGA GAC GGG GAA 2640 
His Leu Glu Trp Asn Pro Asp Leu Asp Cys Ser Cys Arg Asp Gly Glu 
865 870 875 880 

AAA TGT GCA CAT CAT TCC CAT CAT TTC ACC TTG GAT ATT GAT GTT GGA 2688 
Lys Cys Ala His His Ser Kis His Phe Thr Leu Asp He Asp Val Gly 
885 890 895 

TGT ACA GAC TTA AAT GAG GAC TTA GGT GTA TGG GTG ATA TTC AAG ATT 2736 
Cys Thr Asp Leu Asn Glu Asp Leu Gly Val Trp Val He Phe Lys He 
900 905 910 

AAG ACG CAA GAT GGC CAT GCA AGA CTA GGG AAT CTA GAG TTT CTC GAA 27 84 
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Lys Thr Gin Asp Gly His Ala Arg Leu Gly Asn Leu Glu Phe Leu Glu 
915 920 925 

GAG AAA CCA TTA TTA GGG GAA GCA CTA GCT CGT GTG AAA AGA GCG GAG 2832 
Glu Lys Pro Leu Leu Gly Glu Ala Leu Ala Arg Val Lys Arg Ala Glu 
930 935 940 

AAG AAG TGG AGA GAG AAA CGA GAG AAA CTG CAG TTG GAA ACA AAT ATT 2880 
Lys Lys Trp Arg Asp Lys Arg Glu Lys Leu Gin Leu Glu Thr Asn He 
945 950 955 960 

GTT -TAT AAA GAG GCA AAA GAA TCT GTA GAT GCT TTA TTT GTA AAC TCT 2928 
Val Tyr Lys Glu Ala Lys Glu Ser Val Asp Ala Leu Phe Val Asn Ser 
965 970 975 

CAA TAT GAT AGA TTA CAA GTG GAT ACG AAC ATC GCG ATG ATT CAT GCG 2976 
Gin Tyr Asp Arg Leu Gin Val Asp Thr Asn He Ala Met He His Ala . 

980 985 990 ^rr:^^- 

GCA GAT AAA CGC GTT CAT AGA ATC CGG GAA GCG TAT CTG CCA GAG TTG 3024 
Ala Asp Lys Arg Val His Arg He Arg Glu Ala Tyr Leu Pro Glu Leu 
995 1000 1005 

TCT GTG ATT CCA GGT GTC AAT GCG GCC ATT TTC GAA GAA TTA GAG GGA 3072 
Ser Val He Pro Gly Val Asn Ala Ala He Phe Glu Glu Leu Glu Gly 
1010 1015 1020 

CGT ATT TTT ACA GCG TAT TCC TTA TAT GAT GCG AGA AAT GTC ATT AAA 3120 
Arg He Phe Thr Ala Tyr Ser Leu Tyr Asp Ala Arg Asn Val He Lys 
1025 1030 1035 1040 

AAT GGC GAT TTC AAT AAT GGC TTA TTA TGC TGG AAC GTG AAA GGT CAT 3168 
Asn Gly Asp Phe Asn Asn Gly Leu Leu Cys Trp Asn Val Lys Gly His 
1045 1050 1055 

GTA GAT GTA GAA GAG CAA AAC AAC CAC CGT TCG GTC CTT GTT ATC CCA 3216 
Val Asp Val Glu Glu Gin Asn Asn His Arg Ser Val Leu Val He Pro 
1060 1065 1070 

GAA TGG GAG GCA GAA GTG TCA CAA GAG GTT CGT GTC TGT CCA GGT CGT 3264 
Glu Trp Glu Ala Glu Val Ser Gin Glu Val Arg Val Cys Pro Gly Arg 
1075 1080 1085 

GGC TAT ATC CTT CGT GTC ACA GCA TAT AAA GAG GGA TAT GGA GAG GGC 3312 
Gly Tyr He Leu Arg Val Thr Ala Tyr Lys Glu Gly Tyr Gly Glu Gly 
1090 1095 1100 

TGC GTA ACG ATC CAT GAG ATC GAA GAC AAT ACA GAC GAA CTG AAA TTC 3360 
Cys Val Thr He His Glu He Glu Asp Asn Thr Asp Glu Leu Lys Phe 
1105 1110 1115 1120 

AGC AAC TGT GTA G^iA GAG GAA GTA TAT CCA AAC AAC ACA GTA ACG TGT 3408 
Ser Asn Cys Val Glu Glu Glu Val Tyr Pro Asn Asn Thr Val Thr Cys 
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1125 1130 1135 

AAT AAT TAT ACT GGG ACT CAA GAA GAA TAT GAG GGT ACG TAG ACT TCT 3456 
Asn Asn Tyr Thr Gly Thr Gin Glu Glu Tyr Glu Gly Thr Tyr Thr Ser 
1140 1145 1150 

CGT AAT CAA GGA TAT GAC GAA GCC TAT GGT AAT AAC CCT TCC GTA CCA 3504 
Arg Asn Gin Gly Tyr Asp Glu Ala Tyr Gly Asn Asn Pro Ser Val Pro 
1155 1160 1165 

GCT GAT TAG GCT TCA GTC TAT GAA GAA AAA TCG TAT ACA GAT GGA CGA 3552 
Ala Asp Tyr Ala Ser Val Tyr Glu Glu Lys Ser Tyr Thr Asp Gly Arg 
1170 1175 1180 

AGA GAG AAT CCT TGT GAA TCT AAC AGA GGC TAT GGG -GAT TAG ACA CCA 3600 
Axg Glu Asn Pro Cys Glu Ser Asn Arg Gly Tyr Gly Asp Tyr Thr Pro 
1185 1190 1195 1200. 

CTA CCG GCT GGT TAT GTA ACA AAG GAT TTA GAG TAG TTC CCA GAG ACC 364^8 
Leu Pro Ala Gly Tyr Val Thr Lys Asp Leu Glu Tyr Phe Pro Glu Thr 
1205 1210 1215 

GAT. AAG GTA TGG ATT GAG ATC GGA GAA ACA GAA GGA ACA TTC ATC GTG 3696 
Asp Lys. Val Trp lie Glu lie Gly Glu Thr Glu Gly Thr Phe lie Val 
1220 1225 1230 

GAT AGC GTG GAA TTA CTC CTT ATG GAG GAA 3726 
Asp Ser Val Glu Leu Leu Leu Met Glu Glu 
1235 1240 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: ■. 

(A) LENGTH: 1242 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Met Asn Gin Asn Lys His Gly lie lie Gly Ala Ser Asn Cys Gly Cys 
15 10 15 

Ala Ser Asp Asp Val Ala Lys Tyr Pro Leu Ala Asn Asn Pro Tyr Ser 
20 25 30 

Ser Ala Leu Asn Leu Asn Ser Cys Gin Asn Ser Ser lie Leu Asn Trp 
35 40 45 

1., 

lie Asn lie lie Gly Asp Ala Ala Lys Glu Ala Val Ser lie Gly Thr 
50 55 60 
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Thr lie Val Ser Leu lie Thr Ala Pro Ser Leu Thr Gly Leu lie Ser 
65 70 75 80 

lie Val Tyr Asp Leu lie Gly Lys Val Leu Gly Gly Ser Ser Gly Gin 
85 90 95 

Ser lie Ser Asp Leu Ser Ile,.Cys Asp Leu Leu Ser lie lie Asp Leu 
100 105 110 

Arg Val Ser Gin Ser Val Leu Asn Asp Gly lie Ala Asp Phe Asn Gly 
115 120 125 

Ser Val Leu Leu Tyr Arg Asn Tyr Leu Glu Ala Leu Asp Ser Trp Asn 
130 135 140 \ 

Lys Asn Pro Asn Ser Ala Ser Ala Glu Glu Leu Arg Thr Arg Phe Arg 
145 150 155 160 

lie Ala Asp Ser Glu Phe Asp Arg lie Leu Thr Arg Gly Ser Leu Thr 
165 170 175 

Asn Gly Gly Ser Leu Ala Arg Gin Asn Ala Gin lie Leu Leu Leu Pro 
180 185 190 

Ser Phe Ala Ser Ala Ala Phe Phe His, Leu Leu Leu Leu Arg Asp Ala 
195 200 205 

Thr Arg Tyr Gly Thr Asn Trp Gly Leu Tyr Asn Ala Thr Pro Phe lie 
210 215 220 

Asn Tyr Gin Ser Lys Leu Val Glu Leu lie Glu Leu Tyr Thr Asp Tyr 
225 230 235 240 

Cys Val His Trp Tyr Asn Arg Gly Phe Asn Glu Leu Arg Gin Arg Gly 
245 250 255 

Thr Ser Ala Thr Ala Trp Leu Glu Phe His Arg Tyr Arg Arg Glu Met 
260 265 270 

Thr Leu Met Val Leu Asp lie Val Ala Ser Phe Ser Ser Leu Asp lie 
275 280 . 285 

Thr Asn Tyr Pro lie Glu Thr Asp Phe Gin Leu Ser Arg Val lie Tyr 

290 295 300- 

Thr Asp Pro lie Gly Phe Val His Arg Ser Ser Leu Arg Gly Glu Ser 
305 310 315 320 

Trp Phe Ser Phe Val Asn Arg Ala Asn Phe Ser Asp Leu Glu Asn Ala 
325 * 330 335 

lie Pro Asn Pro Arg Pro Ser Trp Phe Leu Asn Asn Met lie lie Ser 
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340 

Thr Gly Ser Leu Thr Leu Pro Val 
355 360 

Val Trp Tyr Gly Ser Arg Asp Arg 
370 375 



345 350 

Ser Pro Ser Thr Asp Arg Ala Arg 
365 

lie Ser Pro Ala Asn Ser Gin Phe 
380 



lie Thr Glu Leu lie Ser Gly Gin 
385 390 

Leu Gly Arg Asn lie Phe Arg Val 
405 

Asp Thr Thr Tyr Gly Val Asn Arg 
420 

Glu Gly Ser Gin Arg Ser Val Tyr 

435 440 



His Thr Thr Ala Thr Gin Thr lie 
395 400 

Asp Ser Gin Ala Cys Asn Leu Asn 
410 415 

Ala Val Phe Tyr His Asp Ala Ser 
425 430 

Glu Gly Tyr lie Arg Thr Thr Gly 
445 



lie Asp Asn Pro Arg Val Gin Asn 
450 455 

Asn Ser Asp lie Pro Thr Pro Glu 
465 470 

Thr lie Asn Leu Thr Gly Gly Leu 
485 

Ser Ser Leu Val Met Tyr Gly Trp 
500 

Asn Thr lie Asn Pro Asp Arg lie 
515 520 

Phe Arg Val Trp Gly Gly Thr Ser 
530 535 



lie Asn Thr Tyr Leu Pro Gly Glu 
460 

Asp Tyr Thr His lie Leu Ser Thr 
475 480 

Arg Gin Val Ala Ser Asn Arg Arg 
490 495 

Thr His Lys Ser Leu Ala Arg Asn 
505 510 

Thr Gin lie Pro Leu Val Lys Gly 
525 

Val lie Thr Gly Pro Gly Phe Thr 
540 



Gly Gly Asp lie Leu Arg Arg Asn 
545 550 

Gin Val Asn lie Asn Ser Pro lie 
565 

Arg Tyr Ala Ser Ser Arg Asp Ala 
580 

Ala Ser Thr Gly Val Gly Gly Gin 
595 600 



Thr Phe Gly Asp Phe Val Ser Leu 
555 560 

Thr Gin Arg Tyr Arg Leu Arg Phe 
570 575 

Arg Val lie Val Leu Thr Gly Ala 
585 590 

Val Ser Val Asn Met Pro Leu "Gin 
605 



Lys Thr Met Glu lie.. Gly Glu Asn Leu Thr Ser Arg Thr Phe Arg Tyr 
610 615 620 




130-4080/PCT/CIP 



Thr Asp Phe Ser Asn Pro Phe Ser Phe Arg Ala Asn Pro Asp lie lie 
625 630 635 640 

Gly lie Ser Glu Gin Pro Leu Phe Gly Ala Gly Ser lie Ser Ser Gly 
645 650' 655 

Glu Leu Tyr lie Asp Lys lie Glu lie lie Leu Ala Asp Ala Thr Phe 
660 665 670 

Glu Ala Glu Ser Asp Leu Glu Arg Ala Gin Lys Ala Val Asn Ala Leu 
675 680 685 

Phe Thr Ser Ser Asn Gin lie Gly Leu Lys Thr Asp Val Thr Asp Tyr 
690 695 700 

His lie Asp Gin Val Ser Asn Leu Val Asp Cys Leu Ser Asp Glu Phe 
705 710 715 720 

Cys Leu Asp Glu Lys Arg Glu Leu Ser Glu Lys Val Lys His Ala Lys 
725 730 735 

% Arg Leu Ser Asp Glu Arg Asn Leu Leu Gin Asp Pro Asn Phe Arg Gly 
740 745 750 

y i! 

He Asn Arg Gin Pro Asp Arg Gly Trp Arg Gly Ser Thr Asp He Thr 
JJ 755 760 765 

^i? He Gin Gly Gly Asp Asp Val Phe Lys Glu Asn Tyr Val Thr Leu Pro 
P 770 775 780 

O Gly Thr Val Asp Glu Cys Tyr Pro Thr Tyr Leu Tyr Gin Lys He Asp 

-•0 785 790 795 800 

lis 

ill Glu Ser Lys Leu Lys Ala Tyr Thr Arg Tyr Glu Leu Arg Gly Tyr He 

805 810 815 

Glu Asp Ser Gin Asp Leu Glu He Tyr Leu He Arg Tyr Asn Ala Lys 
820 825 830 

His Glu He Val Asn Val Pro Gly Thr Gly Ser Leu Trp Pro Leu Ser 
835 840 845 

Ala Gin Ser Pro He Gly Lys Cys Gly Glu Pro Asn Arg Cys Ala Pro 
•850 855 860 

His Leu Glu Trp Asn Pro Asp Leu Asp Cys Ser Cys Arg Asp Gly Glu 
865 870 875 880 

Lys Cys Ala His His Ser His His Phe Thr Leu Asp He Asp Val Gly 
885 890 895 

Cys Thr Asp Leu Asn Glu Asp Leu Gly Val Trp Val He Phe Lys He 
900 905 910 
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Lys Thr Gin Asp Gly His Ala Arg Leu Gly Asn Leu Glu Phe Leu Glu 
915 920 925 

Glu Lys Pro Leu Leu Gly Glu Ala Leu Ala Arg Val Lys Arg Ala Glu 
930 935 940 

Lys Lys Trp Arg Asp. Lys Arg Glu Lys Leu Gin Leu Glu Thr Asn lie 
945 950 955 960 

Val Tyr Lys Glu Ala Lys Glu Ser Val Asp Ala Leu Phe Val Asn Ser 
965 970 975 

Gin Tyr Asp Arg Leu Gin Val Asp Thr Asn lie Ala Met lie His Ala 
980 985 \ 990 

Ala Asp Lys Arg Val His Arg lie Arg Glu Ala Tyr Leu Pro Glu Leu . 
995 1000 1005 

Ser Val lie Pro Gly Val Asn Ala Ala lie Phe Glu Glu Leu Glu Gly 
1010 1015 1020 

Arg. lie Phe Thr Ala Tyr Ser Leu Tyr Asp Ala Arg Asn Val lie Lys 
1025 1030 1035 1040 

Asn Gly Asp Phe Asn Asn Gly Leu Leu Cys Trp Asn Val Lys Gly His 
1045 1050 1055 

Val Asp Val Glu Glu Gin Asn Asn His Arg Ser Val Leu Val lie Pro 
1060 1065 1070 

Glu Trp Glu Ala Glu Val Ser Gin Glu Val Arg Val Cys Pro Gly Arg 
1075 1080 1085 

Gly Tyr lie Leu Arg Val Thr Ala Tyr Lys Glu Gly Tyr Gly Glu Gly 
1090 1095 1100 

Cys Val Thr lie His Glu lie Glu Asp Asn Thr Asp Glu Leu Lys Phe 
1105 1110 1115 1120 

Ser Asn Cys Val Glu Glu Glu Val Tyr Pro Asn Asn Thr Val Thr Cys 
1125 1130 1135 

Asn Asn Tyr Thr Gly Thr Gin Glu Glu Tyr Glu Gly Thr Tyr Thr Ser 
1140 1145 1150 

Arg Asn Gin Gly Tyr Asp Glu Ala Tyr Gly Asn Asn Pro Ser Val Pro 
1155 1160 1165 

Ala Asp Tyr Ala Ser Val Tyr Glu Glu Lys Ser Tyr Thr Asp Gly Arg 
1170 1175 1180 

Arg Glu Asn Pro Cys Glu Ser Asn Arg Gly Tyr Gly Asp Tyr Thr Pro 
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1185 



1190 



1195 



1200 



Leu Pro Ala Gly Tyr Val Thr Lys Asp Leu Glu Tyr Phe Pro Glu Thr 
1205 1210 1215 



Asp Lys Val Trp lie Glu lie Gly Glu Thr Glu Gly Thr Phe lie Val 
1220 1225 1230 



Asp Ser Val Glu Leu Leu Leu Met Glu Glu 
1235 1240 



(2) INFORMATION FOR SEQ ID N0:13: 



(i) 



SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 



(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Bglll site downstream of 
translation termination codon of CrylC." 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
ATAAGATCTG TT 

(2) INFORMATION" FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
. (A) DESCRIPTION: /desc,= "primer" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
GCTAGCCATG GATCAAAATA AACACGGAAT TATTG 3 5 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 



(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "primer" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
CTGGTCAGAT CTTTGAAGTA GAGCTCC ^ 27 
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